Tantrum Audio logo
Learn

Studio Monitoring, Explained

A growing library of clear, technically accurate guides on studio monitoring, room acoustics, mix translation and loudspeaker design โ€” written for engineers and producers who want to understand the why, not just the what.

Monitoring Fundamentals

What Is a Point-Source Studio Monitor?

A point-source studio monitor reproduces its entire frequency range from a single point in space, rather than splitting the signal between separate woofer and tweeter drivers mounted apart from each other. In practice this means a single full-range driver, or a coaxial arrangement where the tweeter sits at the acoustic centre of the woofer. The aim is a coherent wavefront, with all frequencies arriving at the listening position at the same time and from the same place. This improves phase coherence, time alignment and stereo imaging, and tends to make mix decisions translate more reliably to other playback systems, particularly in small rooms.

Read โ†’

Understanding Stereo Imaging

Stereo imaging is the perceived position, width and depth of sounds across the space between two monitors. It exists because the brain locates sound using differences in level and arrival time between the ears, and a stereo pair recreates those cues so a sound panned between the speakers appears at a point in space, a phantom image. Accurate imaging depends on a symmetrical speaker setup, controlled early reflections, and a coherent, phase-accurate monitor. When those conditions are met, panning, width and depth decisions are easy to judge and translate to other systems. When they aren't, the image smears, drifts or collapses.

Read โ†’

What Is a Studio Monitor?

A studio monitor is a loudspeaker designed for accurate reproduction rather than for a pleasing sound. Its job is to reveal a recording as it is, including its flaws, so that mixing, editing and mastering decisions made on it hold up on other systems. That goal shapes everything about a monitor, including a flat frequency response, low distortion, controlled directivity and, in active designs, amplification matched to the drivers. Monitors come in different sizes and configurations for different rooms and distances, but the defining idea is consistent. A monitor is a measurement instrument for your ears, not a speaker chosen to make music sound its best.

Read โ†’

Nearfield vs Midfield Monitors

Nearfield and midfield describe how far a monitor is designed to be listened to, and that distance changes everything about the choice. Nearfield monitors are meant for close listening, typically under about 1.5 metres, so the direct sound dominates and the room has less influence. Midfield monitors are larger and intended for greater distances in bigger, acoustically treated rooms, where they deliver more output and extension. Most home and project studios are nearfield setups because the room is small and untreated enough that sitting close is the most reliable way to hear accurately. Midfields belong in rooms built to support them.

Read โ†’

What Makes a Studio Monitor Accurate?

An accurate studio monitor reproduces a recording with as little change as possible, so what you hear matches what was recorded. That accuracy is the sum of several measurable qualities rather than one figure. A flat frequency response keeps the tonal balance true, low distortion keeps the output a faithful copy of the input, good time and phase behaviour preserves transients and imaging, and consistent directivity makes the sound the room reflects a faithful copy of the direct sound. No monitor is perfect, and the room can undo good engineering, so accuracy is best understood as a combination of the monitor's design and how it is set up.

Read โ†’

Why Frequency Response Matters

Frequency response describes how evenly a monitor reproduces frequencies across the audible range. A flat response reproduces all frequencies at close to their recorded level, so you hear the true tonal balance of a mix. When the response is coloured, with boosted or suppressed regions, you compensate for that colouration in the mix in a way that is wrong on other systems, which is why response is central to translation. Reading a response graph means looking at how flat the line is, over what range, and within what tolerance, and considering off-axis behaviour as well as on-axis, because the room reflects the off-axis sound back to you.

Read โ†’

Mix Translation

How to Improve Mix Translation

Mix translation is how well a mix holds together when played on systems other than the one it was made on. Poor translation is rarely random. It almost always traces back to a small set of causes, including an untreated room that distorts what you hear in the low end, monitors or headphones with a hyped response, mixing too loud, and over-reliance on a single playback system. Fixing translation means fixing those causes in order. Get the monitoring chain honest, calibrate your level, reference against known material, check mono and loudness, and confirm decisions on more than one system before you commit.

Read โ†’

Why Mixes Sound Different on Different Systems

A mix sounds different on different systems because every playback chain reproduces it differently. Speakers and headphones have different frequency responses and bandwidths, rooms add their own colouration, small devices lose the low end and often play in mono, and listening level changes how the ear weighs bass and treble. None of this changes the file; it changes what reaches the listener. Understanding the specific causes shows why a mix made on one accurate system can still need checking elsewhere, and which differences you can control through honest monitoring, referencing and checking across systems, and which you simply have to account for.

Read โ†’

Why Some Monitors Translate Better Than Others

Two monitors can both sound fine yet lead to very different results once a mix leaves the studio. The ones that translate well share a few qualities. They have an honest, reasonably flat response that reveals the true balance rather than flattering it, so the corrections you make are corrections to the mix. They are coherent and image clearly, so width, panning and mono decisions are reliable. And they have consistent directivity, so the room reflects a faithful copy of the direct sound and your decisions hold across positions. A monitor translates better when it tells the truth, not when it sounds the most impressive.

Read โ†’

Small-Room Acoustics

Why Small Rooms Cause Bass Problems

Small rooms cause bass problems because low-frequency sound has wavelengths comparable to, or larger than, the room's dimensions. At those frequencies the room stops behaving like open space and starts resonating. Standing waves (room modes) build up between parallel surfaces, creating peaks where bass is exaggerated and nulls where it almost disappears, with the effect changing dramatically from one seat to the next. Reflections from nearby boundaries also cancel parts of the low end (SBIR). The smaller the room, the higher in frequency these problems reach and the more widely spaced and audible the individual modes become, and that is why bass is the hardest thing to get right in a small studio.

Read โ†’

Understanding Room Modes

Room modes are resonances that build up when low-frequency sound reflects between a room's surfaces and reinforces itself, creating fixed regions where bass is exaggerated and others where it nearly disappears. They are the main reason bass sounds uneven and changes from one seat to the next, and they dominate the low end of every enclosed room. Modes come in three types, axial, tangential and oblique, with axial modes doing most of the audible damage. You can estimate the strongest modes from the room's dimensions, and you can reduce their effect with placement, bass trapping and, to a degree, correction, though no approach removes them entirely.

Read โ†’

Speaker Placement in Small Studios

Speaker placement is the cheapest and often the most effective way to improve monitoring in a small studio. Where the monitors and your seat sit relative to the walls sets how strongly room modes are excited, where boundary cancellation lands, and how stable the stereo image is. The core principles are a symmetrical setup, an equilateral triangle between the speakers and your head, tweeters at ear height, and a sensible distance from the front and side walls. Getting these right costs nothing and changes the problem before you reach for treatment or correction, which is why placement comes first.

Read โ†’

Desk Reflections Explained

A desk reflection is the sound from your monitors that bounces off the desk surface and reaches your ears shortly after the direct sound. Because the reflected path is slightly longer, the two combine out of step and cancel at some frequencies and reinforce at others, an effect called comb filtering. The audible result is a dip in the response, usually in the lower midrange, plus a smearing of the stereo image. The cause is geometry, so the fixes are geometric: raise and decouple the monitors, tilt them, move them clear of the desk, and reduce the reflective area between you and the speakers.

Read โ†’

Buying Guides

Best Studio Monitors for Small Rooms: What to Look For

The best studio monitor for a small room is the one matched to the room, not the biggest or highest-output pair you can afford. Small rooms are listened to at close range and already struggle with bass, so the criteria that matter shift. They include a compact woofer that doesn't overload the room's modes, a controlled (often sealed) low end, coherent point-source-style imaging at short distances, appropriate rather than maximum SPL, controlled dispersion to limit reflections, and onboard DSP for response and room adjustment. This guide explains each criterion so you can evaluate any monitor against your own room rather than relying on a list of model names.

Read โ†’

Best Compact Studio Monitors: What to Look For

A compact studio monitor is a small monitor intended for desks, small rooms and nearfield listening. The useful point is that small does not mean inaccurate. A well-designed compact monitor can image precisely and reproduce the midrange and treble faithfully, and at the short distances small rooms are used at, its size is an advantage rather than a limitation. The trade-offs are low-end extension and maximum output, both of which follow from a small driver and cabinet. This guide covers the criteria that separate a good compact monitor from a toy, and when compact is the right choice rather than a compromise.

Read โ†’

Best Studio Monitors Under ยฃ1000: How to Spend It

Around ยฃ1000 is enough for genuinely capable monitoring, but the result depends more on how you spend it than on which pair you buy. Putting the whole budget into the most expensive monitors you can afford, in an untreated room on a desk, gives a worse result than spending less on a correctly sized, accurate pair and using the rest on stands, treatment and, if needed, a subwoofer. This guide treats ยฃ1000 as a system budget. It covers how to allocate it, what to prioritise in the monitor itself, and why the room and setup decide whether the monitors can perform.

Read โ†’

Comparisons

Point-Source vs Traditional Studio Monitors

Studio monitors mostly fall into two camps. A traditional two-way monitor splits the signal between a separate woofer and tweeter, mounted apart on the front panel, with a crossover dividing the work between them. A point-source monitor reproduces the whole range from a single point, either with one full-range driver or a coaxial design where the tweeter sits at the centre of the woofer. The two-way generally offers more low-end extension and output for the money, while the point source offers inherent time alignment, better phase coherence and more stable imaging. Neither is universally better. The right choice depends on your room, listening distance, the music and your priorities.

Read โ†’

Studio Monitors vs Hi-Fi Speakers

Studio monitors and hi-fi speakers are both loudspeakers, but they are designed for opposite goals. A studio monitor aims for accuracy, reproducing a recording as it is so problems are easy to hear and fix. A hi-fi speaker aims for enjoyment, often with a voicing that flatters most music. That difference shows up in frequency response, distortion behaviour, directivity and intended listening distance. Neither is better in the abstract. For mixing, editing and mastering, the monitor's honesty is essential. For relaxed listening, a hi-fi speaker's pleasing balance may be more satisfying. Problems arise mainly when one is used for the other's job.

Read โ†’

Active vs Passive Studio Monitors

An active studio monitor has its amplifier (or amplifiers) built into the cabinet and matched to its drivers, and you feed it a line-level signal. A passive monitor has no internal amplification and needs a separate power amp chosen and matched by the user. The active approach lets the designer optimise the amp, crossover and any DSP for the specific drivers, which is why most modern monitors are active. Passive designs offer flexibility in amplifier choice and can simplify the cabinet, but they put the matching burden on the user. For most studios, active monitors are the simpler and more predictable choice.

Read โ†’

Coaxial vs Traditional Monitor Designs

A coaxial monitor places the tweeter at the acoustic centre of the woofer, so the two drivers share an axis and radiate from effectively one point. A traditional two-way mounts the woofer and tweeter at separate positions on the baffle. Both are two-way designs with a crossover, but the coaxial behaves as a point source, with coincident drivers, inherent time alignment and even directivity. That coherence comes at the cost of engineering complexity, since the woofer cone acts as part of the tweeter's path and the two drivers interact. The traditional spaced layout is simpler and cheaper to build well, at the cost of coherence around the crossover.

Read โ†’

Sealed vs Ported Studio Monitors

A sealed monitor uses a closed cabinet, while a ported (bass-reflex) monitor adds a tuned vent that uses the air inside the box to extend low-frequency output. The port buys roughly an extra half-octave of bass and higher output for a given driver and box size, at the cost of a steeper roll-off, more low-frequency group delay, and possible port noise at high levels. A sealed box gives a gentler roll-off and tighter transient and phase behaviour, trading some deep extension. For small rooms, where bass is already uneven, the sealed box's control is often the more useful choice, with a subwoofer added when deep extension is needed.

Read โ†’

Acoustics

Understanding Phase Coherence

Phase coherence describes how faithfully a monitor preserves the relative timing of frequency components as it reproduces a signal. A perfectly phase-coherent system delays all frequencies equally (or not at all), so a transient arrives intact. Real monitors introduce frequency-dependent phase shift, quantified by phase response and group delay. The main sources are crossovers, physical driver offset and ported enclosures. Most of these behave as minimum-phase systems, meaning magnitude and phase are linked, while linear-phase DSP can flatten phase at the cost of pre-ringing and latency. This article explains the theory, where coherence is lost, how it's measured via impulse and step response, and what is genuinely audible.

Read โ†’

Understanding Speaker Directivity

Directivity describes how a loudspeaker's output changes with the angle you listen from. A speaker does not radiate every frequency equally in all directions, and the way its response narrows or widens with frequency is its directivity. This matters because the sound radiated off-axis reaches you as room reflections, so if the off-axis response differs in tone from the on-axis sound, those reflections colour what you hear even when the direct sound measures flat. The useful goal is consistent directivity, where the off-axis response keeps the same shape as the on-axis response, rather than simply wide or narrow dispersion.

Read โ†’

Impulse Response Explained

The impulse response is what a system outputs when fed a single short spike. For a monitor or a room it is a complete description of linear behaviour, because its Fourier transform gives both the frequency response and the phase response, and its shape in time reveals reflections, ringing and the arrival of each driver. A clean, narrow impulse means a coherent, well-controlled system, while a smeared one with later spikes shows reflections or driver misalignment. It is measured with a calibrated microphone and a sweep, and it underlies frequency-response graphs, waterfall plots and room correction.

Read โ†’

What Is Time Alignment?

Time alignment means that all of a monitor's output reaches the listening position at the same instant, so a transient is reproduced as a single clean event rather than smeared across time. A single full-range driver is time-aligned by construction because there is only one source. A two-way is not automatically aligned, because the woofer and tweeter sit at different positions and their acoustic centres are at different distances from the listener, so their outputs arrive at slightly different times. Designers correct this with sloped or stepped baffles, careful crossover design, or DSP delay, while point-source and coaxial designs largely avoid the problem.

Read โ†’

Subwoofers

Should You Use a Subwoofer in a Small Studio?

A subwoofer adds low-frequency extension below what compact monitors reach, and it lets you place the bass source independently of the main speakers. In a small studio that can be useful, but a sub is not an automatic upgrade. It still excites the same room modes, and poorly integrated it adds boom and confusion rather than clarity. Whether you should use one depends on your monitors, your room and your work. The honest position is that a subwoofer helps when it is well placed, level-matched, time-aligned and crossed over sensibly, alongside room treatment, and that without that care it often makes a small room sound worse.

Read โ†’

How to Integrate a Studio Subwoofer

Integrating a subwoofer is the process of making it work as one system with the main monitors rather than as a separate source of bass. Four things have to be set correctly: where the subwoofer sits, the crossover frequency where the mains hand over to it, the subwoofer's level relative to the mains, and the phase or time alignment so the two sum cleanly at the crossover. The reliable method is to set each in order and measure as you go, because a subwoofer that is placed, crossed, levelled and aligned well adds even extension, while one that is not adds boom and masks detail.

Read โ†’

What Crossover Frequency Should I Use?

The crossover frequency is the point at which low-frequency content is handed from the main monitors to the subwoofer. It is chosen mainly from how low the mains play cleanly, so the handover sits where the mains begin to roll off, commonly somewhere between 60 and 100 Hz. Around 80 Hz is a frequent starting point because bass below it is hard to localise, which keeps the subwoofer from drawing attention to its position. Smaller mains cross higher, larger mains lower. The room and the crossover slopes also matter, so the right value is set by measurement and listening, not by a single number.

Read โ†’

Sign up to hear more when we release something new

Tantrum Audio logo in footer
Tantrum Audio Instagram Share Widget
Tantrum Audio Facebook Share Widget

ยฉ Tantrum Audio 2026

๐Ÿช Help us improve by allowing analytics cookies?