More

freemanjiang · 2025-04-30T02:08:06 1745978886

It doesn't at the moment, but I think it probably should. There's a non-trivial amount of clock drift that can happen over long periods of time.

freemanjiang · 2025-04-30T02:07:21 1745978841

Thanks for the heads up! Just added a license to the repo.

freemanjiang · 2025-04-29T21:52:28 1745963548

Yes! The very next step.

freemanjiang · 2025-04-29T21:52:20 1745963540

It works on other platforms! Just not as smooth as Chrome.

freemanjiang · 2025-04-29T20:23:53 1745958233

Yeah the threshold is pretty brutal, but it is enough. Experimentally, I'd say you need under 2-3ms but even at 1ms you can start to hear some phase differences.

Most of the time, I think my synchronization algorithm is actually sub-1ms, but it can be worse depending on unstable network conditions.

mkishi · 2025-04-29T21:48:19 1745963299

How are you measuring this? I'm surprised the Web Audio API scheduling system has that much insight into the hardware latency.

freemanjiang · 2025-04-29T20:00:26 1745956826

Great question! There's two steps:

First, I do clock synchronization with a central server so that all clients can agree on a time reference.

Then, instead of directly manipulating the hardware audio ring buffers (which browsers don't allow), I use the Web Audio API's scheduling system to play audio in the future at a specific start time, on all devices.

So a central server relays messages from clients, telling them when to start and which sample position in the buffer to start from.

camtarn · 2025-04-29T20:06:51 1745957211

Interesting. Feels like this might still have some noticeable tens-of-millisends latency on Windows, where the default audio drivers still have high latency. The browser may intend to play the sound at time t, but when it calls Windows's API to play the sound I'm guessing it doesn't apply a negative time offset?

serial_dev · 2025-04-29T20:18:30 1745957910

So it doesn't need to use the microphone? I guess from the "works across the ocean" comment and based on this description. I would have thought you would listen to the mic and sync based on surrounding audio somehow but it's good to know that it's not needed.

freemanjiang · 2025-04-29T20:25:12 1745958312

Yup no microphone. It's all clock sync

freemanjiang · 2025-04-29T19:52:22 1745956342

Thank you for the kind words! Yeah, I think it gets a lot more complicated once you start dealing with speaker hardware. It pretty much only works for the device's native speaker at the moment.

The instant you start having wireless speakers (eg. bluetooth) or any sort of significant delay between commanding playback and the actual sound coming out, the latency becomes audible.

raisedbyninjas · 2025-04-29T23:20:02 1745968802

For devices with mics, can you have them play a test chirp to measure the latency of Bluetooth or other laggy sound stack?

freemanjiang · 2025-04-29T19:29:29 1745954969

I primarily built this for group in-person listening, and that's what the spatial audio controls are for. But what is interesting is that since it only requires the browser, it works across the internet as well. You can guarantee that you and someone else are listening to the same thing even across an ocean.

Someone brought up the idea of an internet radio, which I thought was cool. If you could see a list of all the rooms people are in and tune it to exactly what they're jamming to.

Ne02ptzero · 2025-04-29T19:43:10 1745955790

> You can guarantee that you and someone else are listening to the same thing even across an ocean.

How can you guarantee that? NTP fails to guarantee that all clocks are synced inside a datacenter, let alone across an ocean (Did not read the code yet)

EDIT: The wording got me. "Guarantee" & "Perfect" in the post title, and "Millisecond-accurate synchronization" in the README. Cool project!

moomin · 2025-04-29T20:05:05 1745957105

More, the speed of light puts a hard cap on how simultaneous you can be. Wolfram Alpha reckons New York to London is 19ms in a vacuum, more using fibre.

Going off on a tangent: Back in the days of Live Aid, they tried doing a transatlantic duet. Turns out it’s literally physically impossible because if A songs when they hear B, then B hears A at least 38ms too late, which is too much for the human body to handle and still make music.

recursive · 2025-04-29T23:18:33 1745968713

It's a less hard problem than the duet. If the round-trip is 38ms, you can estimate that the one-way latency is 19ms. You tell the the other client to play the audio now, and you schedule it for 19ms in the future.

That's assuming standard OS and hardware and drivers can manage latency with that degree of precision, which I have serious doubts about.

In a duet, your partner needs to hear you now and you need to hear them now. With pre-recorded audio, you can buffer into the future.

freemanjiang · 2025-04-29T19:23:52 1745954632

yes but only after posting! it's very cool—i'm actually a little embarrassed to not have seen it before.

they're doing a smarter thing by doing streaming. i don't do any streaming right now.

the upside is that beatsync works in the browser. just a link means no setup is required.

freemanjiang · 2025-04-29T19:10:08 1745953808

thank you!