frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: What do SRE do at your company?

3•petemc_•3h ago
The SRE role seems to mean wildly different things depending on who you ask.

Comments

decatur•2h ago
Produce hot air, check boxes, and 'see, I told you so'
VirusNewbie•2h ago
I'm a SWE SRE at Google. That means we had to do a SWE interview with an emphasis on system design.

So I'm expected to be able to do both operations for oncall, but also do RCA and implement fixes and changes to make the systems our team is responsible for more reliable.

We're able to throttle the release cadence of binaries, so we work together with dev teams (SWEs who develop features) to come up with appropriate monitoring, metrics, mitigation, and scaling capabilities.

Some SREs are not SWE SREs, they usually have a specialty related to the team they're on, such as networking, low level linux internals, etc. They're still expected to be able to write production level python/Go code.

They are more likely to send a bug to the devs rather than fix it themselves, where as I will often (but not always) just go right in and send a CL to the devs fixing or optimizing something.

petemc_•1h ago
Hey VirusNewbie, thank you for your response! A few more questions if you don't mind.

Did you read the SRE handbook before applying?

How do you decide who gets alerts (or are devs never on call)?

coldfloor•1h ago
I was an SRE at Yahoo until around the end of 2024. Not sure if things have changed - last I heard my former team had been laid off - but when I was there it was pretty easy. We had three tiers in the org, with increasing specificity and expertise: Operations Center -> SRE -> Product Engineers.

The OC collectively monitored everything across the company. Each alert that paged had an associated runbook. If they couldn't clear the alert with the runbook, they'd escalate to the SRE responsible for the alerting server/component. Our job was essentially to fix anything that broke that OC couldn't solve. For my domain this often just came down to basic Linux troubleshooting, but sometimes would actually involve specific knowledge about our component. For others (e.g. networking) I imagine the ratio of domain-specific-knowledge problems was higher.

If we determined something was fundamentally broken, like someone pushed an update and now the service won't start, we'd escalate that to PE. PE did a lot of what I think falls under SRE purview at other places: Managing deployments, building out infrastructure, etc. At Yahoo we were really just "tier 2 ops."

We'd also be paged for outages if our service went down or another team was blaming our service for their outage. The job here was essentially the same thing, just with more pressure and people yelling at you; or arguing and trying to prove your stuff was working, please find someone else to blame. If we were involved in an outage, we'd also have to join the "post mortem" (I'll never be able to say that without air quotes) and help with RCA/take on remediation tasks.

Secondarily, we created the monitoring/alerts that went to OC and wrote and maintained their runbooks. In our downtime we were also supposed to do simple automation/scripting to help us or OC with redundant tasks. Sometimes I think I made useful stuff, but often this felt like self-imposed busy work, because we always - especially under Marissa's stack ranking regime - had to demonstrate that we were doing more than just our job. I swear one quarter between us and OC we ended up with like 10 redundant Slack bots because everyone was rushing to make something to pad their review with.

natyoung•15m ago
Call APIs that 3rd party vendors provide. Talk about AI, because AI. Be silo.

Ask HN: Is WordPress the best way to create new websites for beginner

12•anitroves•5h ago•33 comments

Ask HN: Books about Genetic Algorithms

12•andyjohnson0•2h ago•2 comments

Ask HN: What do SRE do at your company?

3•petemc_•3h ago•5 comments

Ask HN: Is there a bad employers (who have a records of not paying) list?

51•trowa159•9h ago•60 comments

Ask HN: Where is the programming profession going?

152•syntaxbush•3d ago•166 comments

The open source DOCX editor submitted to HN a few weeks ago has been deleted

101•gcanyon•2d ago•44 comments

Ask HN: Is "no source code was copied" still a sufficient copyright defense?

64•oscgam1•2d ago•79 comments

Everyone feared AI taking over; the real danger is AI serving just the few

104•PhilipDaineko•1d ago•69 comments

Ask HN: Smallest amount of working ML weights that can be tattooed on a body?

7•thoughtpeddler•1d ago•4 comments

Ask HN: MacBook vs. Dedicated GPU for LLM

33•mzubairtahir•1d ago•65 comments

Ask HN: What do you predict the world will look like in 5-10 years?

9•justanything•1d ago•11 comments

I patched llama.cpp to gain 20% prompt processing TPS. Help me make a PR

5•i_am_rocoe•1d ago•2 comments

Ask HN: How much coding should beginners learn in the AI era?

36•JohnDSDev•4d ago•50 comments

Ask HN: What GUI/desktop app do you use to keep track of different AI sessions?

4•howToTestFE•1d ago•4 comments

Fast feedback loops is the way

5•skyglider•1d ago•0 comments

Recursive self improvement for human skills

4•rando77•1d ago•2 comments

Ask HN: Has Ilya Sutskever spoken publicly lately?

9•aurenvale•1d ago•1 comments

Tell Zillow: Fee-Simple vs. Leasehold Filter

4•HoldOnAMinute•2d ago•1 comments

Ask HN: Norway bans AI in elementary schools

15•mellosty•3d ago•19 comments

Data Privacy while using API tools

4•11shyam11•1d ago•4 comments

Tell HN: Mojo is becoming open source

8•theanonymousone•1d ago•4 comments

Ask HN: Is there a quiet market for 'no enforced AI' dev jobs?

7•reinhardt•1d ago•10 comments

Ask HN: Techniques for learning things quickly using coding agents?

5•throwaw12•2d ago•2 comments

Ask HN: You have one year to make $1M. What's your plan?

14•vantareed•10h ago•17 comments

Roblox parental controls are a dystopian security disaster

23•notsure357•2d ago•5 comments

Ask HN: Who remembers Fry's Electronics – the "church" of IT people?

8•netfortius•2d ago•4 comments

Ask HN: What home printer do you use/recommend?

20•niyazpk•5d ago•23 comments

Ask HN: Running local LLMs? What's your model and hardware

11•alfiedotwtf•1d ago•9 comments

I feel like VSCode is falling apart

16•othmanosx•3d ago•18 comments

Ask HN: Why does every AI demo sound perfect but real world deployment always

8•VaderAi•2d ago•12 comments