frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Nextflow: System for creating scalable, portable, reproducible workflows

https://github.com/nextflow-io/nextflow
45•saikatsg•12h ago

Comments

totalperspectiv•8h ago
Cool seeing a workflow language pop up on HN!

Nextflow and Snakemake are the two most-used options in bioinformatics these days, with WDL trailing those two.

I really wish Nextflow was based on Scala and not Groovy, but so it goes.

There is a Draft up for dsl3 that adds static types to the channels that I’m very excited about. https://github.com/nf-core/fetchngs/pull/309

_Wintermute•7h ago
The choice of groovy was unfortunate, but yet it still seems more popular than snakemake which I can only attribute to the nf-core set of curated workflows.

I have a dislike of nextflow because it submits 10s of thousands of separate jobs to our HPC scheduler which causes a number of issues, though they've now added support for array jobs which should hopefully solve that.

samuell•4h ago
To implement an efficient dataflow-based programming API/DSL, you better have some support for channels and lightweight threads in a scriptable language, something that you've got in Groovy with the GPars library that Nextflow uses.

We opted for implementing all of this in Go in SciPipe, where we get similar basic dataflow/flow-based functionality as Nextflow with the native concurrency primitives of Go, but the Go syntax probably/surely puts away some biologists who have written some python at most before, and Go won't let us customize the API and hide away as much of the plumbing under nice syntax, as Groovy.

In this regard, Groovy with the GPars library for the concurrency, doesn't seem as a particularly bad choice. There weren't that many options at the time either.

The downside has been tooling support though, such as editor intelligence and debugging support, although parts of that is finally improving now with a NF language server.

Today, one could probably implement something similar with Python's asyncio and queues for the channel semantics, and there is even the Crystal language that has Go-like concurrency in a much more script-like language (see a comparison between Go and Crystal concurrency syntax at [1]), but Crystal would of course be an even more fringe langauge than Groovy.

[1] https://livesys.se/posts/crystal-concurrency-easier-syntax-t...

totalperspectiv•3h ago
I really wish Crystal had taken off a bit. I thought it had a chance in bfx with some good benchmarking and PR by lh3 in biofast.
armedgorilla•7h ago
At a previous Biotech, we used Cromwell/WDL because the DSL was the most intuitive to our bioinformatics scientists. But seeing as that doesn't work as nicely on AWS (and is also supported by an organization that is imploding), we opted for Argo on our K8s cluster to process RNAseq data en masse. Getting the scientists to use YAMl has been an uphill struggle, but the same issues would apply to learning groovy I guess. We've found that the Argo engine is easier to maintain, and also we only have to support one orchestrator across our Bioinformatics and ML teams.

For industrial purposes, I've started to approach these pipelines as a special case of feature extraction and so I'm reusing our ML infrastructure as much as possible.

totalperspectiv•6h ago
I would rather write Groovy than YAML any day of the week.

Why did you rule out Nextflow or Snakemake? I believe they both work with k8 clusters.

Argo doesn’t look great from my standpoint as a workflow author.

azan_•6h ago
I've used Snakemake my whole life, can someone experienced with both systems share whether jumping to nextflow is worth it?
totalperspectiv•6h ago
NF Tower / Seqera would be the selling points. They offer a nice UX for managing pipelines and abstract over AWS.

Technically snakemake can do it all. But in practice NF seems to scale up a bit better.

That said, if you don’t need the UI for scientists, I’d stick to snakemake.

Protostome•5h ago
I have pipelines written in both frameworks. Nextflow (despite the questionable selection of groovy as the language of choice) is more powerful and enables greater flexibility in terms of information flow.

For example, snakemake makes it very difficult if not impossible to create pipelines that deviate from a DAG architecture. In cases where you need loops, conditionals and so on, Nextflow is a better option.

One thing that I didn't like about nextflow is that all processes can either run under apptainer or docker, you can mix and match docker/apptainer like you do in snakemake rules.

chrisweekly•4h ago
"you can mix and match"

you meant "CAN'T", right?

Protostome•1h ago
yep :)
biophysboy•2h ago
I’ve used both. I would say nextflow is a more production-oriented tool. Check out seqera platform to see if any of the features there seem useful. It can also be useful to get out of the wildcards/files mindset for certain workflows. Nextflow chucks the results of a step into a hashed folder, so you don’t have to worry about unique output names.

That said, I do find snakemake easier to prototype with. And it also has plenty of production features (containers, cloud, etc). For many use cases, they’re functionally equivalent

christopher8827•3h ago
ayyy, they used this in one of my previous workpaces in biotech.

Altermagnets: The first new type of magnet in nearly a century

https://www.newscientist.com/article/2487013-weve-discovered-a-new-kind-of-magnetism-what-can-we-do-with-it/
105•Brajeshwar•2h ago•27 comments

KDB-X: KX releases FREE Commercial KDB license

https://www.defconq.tech/blog/From%20Elite%20to%20Everyone%20-%20KX%20Community%20Edition%20Breaks%20Loose
11•AUnterrainer•48m ago•2 comments

Show HN: Improving search ranking with chess Elo scores

https://www.zeroentropy.dev/blog/improving-rag-with-elo-scores
85•ghita_•3h ago•20 comments

Chain of thought monitorability: A new and fragile opportunity for AI safety

https://arxiv.org/abs/2507.11473
39•mfiguiere•3h ago•10 comments

Pgactive: Postgres active-active replication extension

https://github.com/aws/pgactive
188•ForHackernews•8h ago•58 comments

Mkosi – Build Bespoke OS Images

https://mkosi.systemd.io/
21•leetrout•1h ago•3 comments

How and where will agents ship software?

https://www.instantdb.com/essays/agents
5•stopachka•11m ago•0 comments

Weave (YC W25) is hiring a founding AI engineer

https://www.ycombinator.com/companies/weave-3/jobs/SqFnIFE-founding-ai-engineer
1•adchurch•57m ago

Shipping WebGPU on Windows in Firefox 141

https://mozillagfx.wordpress.com/2025/07/15/shipping-webgpu-on-windows-in-firefox-141/
278•Bogdanp•11h ago•109 comments

Cloudflare 1.1.1.1 Incident on July 14, 2025

https://blog.cloudflare.com/cloudflare-1-1-1-1-incident-on-july-14-2025/
446•nomaxx117•14h ago•298 comments

I'm switching to Python and actually liking it

https://www.cesarsotovalero.net/blog/i-am-switching-to-python-and-actually-liking-it.html
172•cesarsotovalero•10h ago•263 comments

How I lost my backpack with passports and laptop

https://psychotechnology.substack.com/p/how-i-lost-my-backpack-with-passports
45•eatitraw•1d ago•37 comments

Mill: A better build tool for Java, Scala, and Kotlin

https://mill-build.org/mill/index.html
35•lihaoyi•2h ago•10 comments

What's happening to reading?

https://www.newyorker.com/culture/open-questions/whats-happening-to-reading
48•Kaibeezy•3d ago•112 comments

'Gentle Parenting' My Smartphone Addiction

https://www.newyorker.com/culture/infinite-scroll/gentle-parenting-my-smartphone-addiction
16•fortran77•2h ago•15 comments

Pascal's Scams (2012)

http://unenumerated.blogspot.com/2012/07/pascals-scams.html
52•walterbell•4d ago•39 comments

Tilck: A Tiny Linux-Compatible Kernel

https://github.com/vvaltchev/tilck
233•chubot•14h ago•44 comments

Thunderbird: Fluent Windows 11 Design

https://github.com/Deathbyteacup/fluentbird
187•skipnup•3d ago•98 comments

Show HN: An MCP server that gives LLMs temporal awareness and time calculation

https://github.com/jlumbroso/passage-of-time-mcp
44•lumbroso•3h ago•28 comments

Atopile – Design circuit boards with code

https://atopile.io/atopile/introduction
39•poly2it•3d ago•11 comments

MARS.EXE → COM (2021)

https://chaos.if.uj.edu.pl/~wojtek/MARS.COM/
115•reconnecting•4d ago•33 comments

Ukrainian hackers destroyed the IT infrastructure of Russian drone manufacturer

https://prm.ua/en/ukrainian-hackers-destroyed-the-it-infrastructure-of-a-russian-drone-manufacturer-what-is-known/
481•doener•9h ago•325 comments

Show HN: BloomSearch – Keyword search with hierarchical bloom filters

https://github.com/danthegoodman1/bloomsearch
17•dangoodmanUT•3d ago•3 comments

GPUHammer: Rowhammer attacks on GPU memories are practical

https://gpuhammer.com/
237•jonbaer•17h ago•80 comments

In-depth system walkthrough: cloud-based VOD

https://app.ilograph.com/demo.ilograph.AWS%2520Video-On-Demand/Workflow
5•billyp-rva•3d ago•0 comments

LLM Daydreaming

https://gwern.net/ai-daydreaming
147•nanfinitum•15h ago•113 comments

Show HN: DataRamen, a Fast SQL Explorer with Automatic Joins and Data Navigation

https://dataramen.xyz/
36•oleksandr_dem•6h ago•37 comments

Show HN: Shoggoth Mini – A soft tentacle robot powered by GPT-4o and RL

https://www.matthieulc.com/posts/shoggoth-mini
551•cataPhil•1d ago•102 comments

cppyy: Automatic Python-C++ Bindings

https://cppyy.readthedocs.io/en/latest/
60•gjvc•5h ago•15 comments

Reflections on OpenAI

https://calv.info/openai-reflections
667•calvinfo•1d ago•350 comments