FediScience Support @admin

**Marc Brooker** @marcbrooker · 5d

New blog post on ten years of AWS Lambda, and some thoughts on the Lambda PRFAQ: https://brooker.co.za/blog/2024/11/14/lambda-ten-years.html

brooker.co.zaTen Years of AWS Lambda - Marc's Blog

Marc Brooker boosted

**Jeremy Daly** @jeremydaly@hachyderm.io · Nov 7

Nov 7

Jeremy Daly @jeremydaly@hachyderm.io

Issue #304 of Off-by-none is out! This week, AWS turns on the pre:Invent firehose, AppSync gives us better #serverless WebSockets, and Neon simplifies row-level security. Plus, we recognize Marc Brooker as our ️ of the week! #offbynone https://offbynone.io/issues/304/

offbynone.ioYou get to drink... from the firehose 🚒 - Off-by-noneIn this issue, AWS turns on the pre:Invent announcement firehose, AppSync gives us better serverless WebSockets, and Neon simplifies row-level security for Postgres.

**Marc Brooker** @marcbrooker · Nov 2

Nov 2

Marc Brooker @marcbrooker

I'm on the AWS Developers podcast this morning, talking to Julian Wood about AWS Lambda's 10th anniversary, the growth of serverless, and some of what we learned along the way.

https://developers.podcast.go-aws.com/web/podcasts/episode_137/index.html

AWS Developers Podcast · Nov 1AWS Lambda: A Decade of TransformationIn this episode of the AWS Developers Podcast, Julian Wood hosts a discussion with Marc Brooker, a distinguished engineer involved in the creation of AWS Lambda. They explore the origins of Lambda, its evolution, and the impact of serverless technology on modern computing. The conversation delves into customer-centric innovations, the challenges of event-driven architectures, and the future of serverless in the context of generative AI. Mark reflects on the journey of Lambda, the lessons learned, and the exciting possibilities that lie ahead for serverless technology.

Continued thread

**Marc Brooker** @marcbrooker · Oct 22

Oct 22

Marc Brooker @marcbrooker

"But what about time-to-market?" has been one of the objections to automated reasoning and formal methods forever, but in many domains they allow us to get to market faster. This is especially true in security, availability, and durability-critical domains.

**Marc Brooker** @marcbrooker · Oct 22

Oct 22

Marc Brooker @marcbrooker

Great new piece from Byron Cook about automated reasoning at AWS, and how we're finding it not only allows us to deliver safer code, but also deliver faster code, and deliver code faster. https://aws.amazon.com/blogs/security/an-unexpected-discovery-automated-reasoning-often-makes-systems-more-efficient-and-easier-to-maintain/

Amazon Web Services · Oct 17An unexpected discovery: Automated reasoning often makes systems more efficient and easier to maintain | Amazon Web ServicesDuring a recent visit to the Defense Advanced Research Projects Agency (DARPA), I mentioned a trend that piqued their interest: Over the last 10 years of applying automated reasoning at Amazon Web Services (AWS), we’ve found that formally verified code is often more performant than the unverified code it replaces. The reason is that the […]

Marc Brooker boosted

**Jonathan Yu** @jawnsy@mastodon.social · Sep 30

Sep 30

Jonathan Yu @jawnsy@mastodon.social

"In practice, the redundant nature of connectivity and ability to use routing mechanisms to send clients to the healthy side of partitions means that the vast majority of cloud systems can offer both strong consistency and high availability to their clients, even in the presence of the most common types of network partitions (and other failures)."

https://brooker.co.za/blog/2024/07/25/cap-again by @marcbrooker

brooker.co.zaLet's Consign CAP to the Cabinet of Curiosities - Marc's Blog

Marc Brooker boosted

**Jonathan Yu** @jawnsy@mastodon.social · Aug 15

Aug 15

Jonathan Yu @jawnsy@mastodon.social

"Increasing memory pressure increases the amount of time it takes for the GC to run, and increases the cost of handling any given request, this increases per-request latency and reduces throughput, this increases the number of requests in flight (and their associated per-request memory), which increases memory pressure."

"[W]ith some collectors, the cost of performing a unit of work can increase by up to 70% as memory pressure increases."

https://brooker.co.za/blog/2024/08/14/gc-metastable.html by @marcbrooker

brooker.co.zaGarbage Collection and Metastability - Marc's Blog

**Marc Brooker** @marcbrooker · Aug 15

Aug 15

Marc Brooker @marcbrooker

New little blog post on garbage collection and metastability: https://brooker.co.za/blog/2024/08/14/gc-metastable.html

brooker.co.zaGarbage Collection and Metastability - Marc's Blog

Marc Brooker boosted

**Lorin Hochstein** @norootcause@hachyderm.io · Aug 9

Aug 9

Lorin Hochstein @norootcause@hachyderm.io

Reminder: what matters is system behavior, not component behavior. It is no consolation to your users that your server implementation is correct when anomalous behavior results from how the client interacts with the server. https://fediscience.org/@marcbrooker/112927015402323009

Marc BrookerMore great work from Kyle, and another reminder that retrying isn't the safe best-practice that many folks assume it is: <a href="https://jepsen.io/analyses/jetcd-0.8.2" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://jepsen.io/analyses/jetcd-0.8.2</a>

Marc BrookerI've been learning Lean recently to add to my formal toolkit, and it's been a really eye-opening experience. I don't have much of a math background, but have already found Lean lets me do things that I found really hard before.

Marc BrookerLoved this talk by Terence Tao on math, AI, and proof assistants. I suspect we're going to see a similar effect in software: AI will allow formal specs to be developed faster than traditional programs, and for programs to be generated from those specs. <a href="https://www.youtube.com/watch?v=_sTDSO74D8Q" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://www.youtube.com/watch?v=_sTDSO74D8Q</a>

Marc BrookerSo much fun systems work here. I'm super proud of the team that shipped this product, and wrote this paper describing (a small part of) their innovation. Check out our paper here: <a href="https://www.amazon.science/publications/resource-management-in-aurora-serverless" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://www.amazon.science/publications/resource-management-in-aurora-serverless</a>

Marc BrookerNext, we had to learn how to actively manage heat across the database fleet, in a way that ensured the right resources are available when customers need them. This work included placement, live migration, and instance-level controls.

Marc BrookerNext, we had to place databases across a large fleet in a way that allowed us to optimize for performance, predictability, and cost. Placement allows us to pick workloads that fit nicely together, such as by wanting to scale at different times.

Marc BrookerTeaching database engines how to scale up isn't too hard: they're hungry hippos and will eat all the memory and CPU you can give them. Teaching them to scale down was harder, and required us to really understand what drives working-set behavior at the engine and OS level.

Marc Brooker

Marc BrookerNew blog post, on our paper about Aurora Serverless V2 "Resource Management in Aurora Serverless": <a href="https://brooker.co.za/blog/2024/07/29/aurora-serverless.html" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://brooker.co.za/blog/2024/07/29/aurora-serverless.html</a> What I loved about this project was being able to innovate at all levels of the stack, from the hypervisor all the way to region-scale clusters.

Marc BrookerLet's assign CAP to the cabinet of curiosities: <a href="https://brooker.co.za/blog/2024/07/25/cap-again.html" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://brooker.co.za/blog/2024/07/25/cap-again.html</a>If you’re an experienced distributed systems person teaching new folks about trade-offs in your space, please don’t start with CAP. Tons of more interesting, more instructive, trade-offs.

Marc BrookerI think the porosity on the head is caused by incomplete burnout, but not sure about the generally rough surface texture (the investment I'm using should be able to do a ton better). Technically not "lost PLA", but "lost PVB" (Polymaker Polycast). Metal is ZA-12 (zinc aluminium alloy).This is entirely the first casting I've done since making die-cast tin soldiers with my friend's grandfathers stuff like 30 years ago.

Marc BrookerFirst attempt at "lost PLA" investment casting: psyduck.

Marc BrookerHere's a practical example of how Formal Methods can make code faster: AWS's new proven-correct authorization engine is 65% faster (even at p99.9) than the previous version. <a href="https://youtu.be/oshxAJGrwMU?si=3kjRoUIaa527u3EI&t=3188" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://youtu.be/oshxAJGrwMU?si=3kjRoUIaa527u3EI&t=3188</a>Sean says in the talk "we wouldn't have attempted a lot of optimizations we did without this backing of proof".Formal methods allow us to optimize systems more boldly, by removing the risk that optimizations will affect correctness.

Marc BrookerThis is super cool: researchers at @UW using time domain reflectometry in existing teleco optic fibers to measure seismic activity at Mt Rainier: <a href="https://www.youtube.com/watch?v=3E5N9B2xpOU" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://www.youtube.com/watch?v=3E5N9B2xpOU</a> As far as I can tell what they're looking at here is variations in Rayleigh scattering of light inside the fiber. Rayleigh scattering is proportional to the eighth power of refractive index, which makes this technique super sensitive to even small changes in the index.

Marc BrookerFascinating: the Cascades Volcano Observatory's seismic monitoring of Mt Rainier is sensitive enough to track avalanches: <a href="https://www.youtube.com/watch?v=HyTTSDXrPcY" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://www.youtube.com/watch?v=HyTTSDXrPcY</a> (but they're looking for something much bigger: lahars).

Marc BrookerThis kind of research is interesting to me, as a practitioner, because correctness is one of the hardest things we do. Finding new ways to reason about the correctness of systems is super valuable to us - both in saving development time and in building better systems.

Marc BrookerThe core idea behind their approach is rather simple: two "facts" about operations encoded in their types.

Marc BrookerI especially like how they've combined three approaches (their type-based approach, model checking in Alloy, and classic testing approaches) to find implementation bugs at multiple different levels. This is exactly the approach to verification and testing I recommend at AWS.

Marc BrookerCrash consistency matters for filesystems and databases because, without it, it's impossible for the FS or DB to provide guarantees that extend beyond unplanned system reboots. Unplanned reboots do happen, and so this is a critical real-world property.

Marc BrookerThis SquirrelFS paper by Hayley LeBlanc et al is extremely cool. The short version is that they use the Typestate pattern (along with Rust's type checker) to check certain crash consistency properties of a filesystem *at compile time*. Check out their paper at <a href="https://arxiv.org/abs/2406.09649" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://arxiv.org/abs/2406.09649</a> (and I think they're at OSDI this week too).

Marc BrookerNew post on my misc blog, suggesting a way to make the Prusa XL a much better printer: <a href="https://brooker.co.za/misc-blog/2024/07/07/prusaxl.html" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://brooker.co.za/misc-blog/2024/07/07/prusaxl.html</a>

Marc BrookerA very nice way to spend an afternoon.

Marc BrookerNew blog post, on the "you don't need distributed systems" meme, and the idea that distributed systems are only about scale: <a href="https://brooker.co.za/blog/2024/06/04/scale.html" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://brooker.co.za/blog/2024/06/04/scale.html</a> Short version: scale is only a tiny part of what makes distributed systems interesting and useful.

Marc BrookerWhy is it considered totally normal to live places where heating is required to stay alive, but some kind of affront to nature to live places where cooling is required to stay alive?

Marc BrookerIgnore the clickbait title, this is fascinating. It goes to show how much we lose when we collapse the full spectrum of color down to a 3-vector.<a href="https://www.youtube.com/watch?v=UQuIVsNzqDk" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://www.youtube.com/watch?v=UQuIVsNzqDk</a>We make color a 3-vector (or a 4-vector) because that's how our color perception works. But it's not how color physically works. It throws out a vast amount of spectral information (not to mention the phase information).

Marc Brooker<a href="https://mastodon.social/@ltratt" class="u-url mention">@ltratt</a> My believe on why UML never really succeeded is that it doesn't do this well. It's on the wrong semantic level to specify the known-knowns about behavior, and to explore the unknown about structure or behavior. Too high level for one, too low for another.

Marc Brooker<a href="https://mastodon.social/@ltratt" class="u-url mention">@ltratt</a> Good software practices, and mature teams, both take maximal advantage of the things they know to reduce iteration, and use iteration as a tool for discovering the things they don't know. They're curious, and expand the set of things they know from experience (theirs and others).

Marc Brooker<a href="https://mastodon.social/@ltratt" class="u-url mention">@ltratt</a> What we should be embarrassed about is unneeded iteration: cases where we wastefully iterate towards goals that are well-known and can be clearly specified. This is a big part of my interest in tools like P and TLA+: going "direct to goal" for the well-understood problems.

Marc BrookerThis is smart stuff from <a href="https://mastodon.social/@ltratt" class="u-url mention">@ltratt</a> I might go further and say that the ability to iterate at relatively low cost is one of the defining benefits of software, and not something we need to be embarrassed about: <a href="https://tratt.net/laurie/blog/2024/what_factors_explain_the_nature_of_software.html" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://tratt.net/laurie/blog/2024/what_factors_explain_the_nature_of_software.html</a>

Marc Brooker

Marc BrookerNew blog post, about Nagle's algorithm and TCP_NODELAY: <a href="https://brooker.co.za/blog/2024/05/09/nagle.html" target="_blank" rel="nofollow noopener noreferrer" translate="no">https://brooker.co.za/blog/2024/05/09/nagle.html</a>In the post, I look into the history of Nagle's algorithm, the interaction with delayed ack, and propose that it's not the right default for the modern internet.

Recent searches

Search options

Administered by:

Server stats:

Marc Brooker@marcbrooker@fediscience.org

Recent searches

Search options

Administered by:

Server stats:

Marc Brooker@marcbrooker@fediscience.orgfediscience.org

Marc Brooker@marcbrooker@fediscience.org