frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open Ports- lsof doesn't always give you what you're looking for

https://theexceptioncatcher.com/2025/09/finding-the-owner-of-an-open-port/
1•monksy•19s ago•0 comments

Show HN: JsonPost – Universal Back End for Static Website Owners

https://jsonpost.com
1•ubergeekady•1m ago•0 comments

Musings on Generative AI

https://leejo.github.io/2025/09/23/gaps/
1•leejo•3m ago•0 comments

From Goldman to AI: How Rishi Bali Is Building Wall St's AI Transformation Layer

https://www.forbes.com/sites/sindhyavalloppillil/2025/09/23/from-goldman-to-ai-how-rishi-bali-is-...
1•sindhya1•7m ago•0 comments

GitOps without Kubernetes: Declarative, Git Docker deploy with simplecontainer

https://blog.simplecontainer.io/gitops-without-kubernetes-declarative-git-driven-docker-deploymen...
2•qdnqn•9m ago•0 comments

OxygenOS/Android bug exposes SMS/MMS

https://www.theregister.com/2025/09/23/rapid7_oneplus_android_bug/
2•rippeltippel•10m ago•0 comments

The Beginning of My Programming Journey

https://docs.google.com/document/d/e/2PACX-1vS6wnp3kbNrjT9TEfHVpEqbAITqH_G6QubjczV8KRXFmOQ1JSTt1o...
1•surprisetalk•10m ago•0 comments

When Your Disney Playlist Saves Your Tech Career [video]

https://www.youtube.com/watch?v=rLJattwAceU
1•surprisetalk•11m ago•0 comments

Apple has trained its AI to respond to Trump's nonsense: report

https://mobilesyrup.com/2025/09/11/apple-ai-model-update-trump-response/
8•billybuckwheat•14m ago•0 comments

What Happened to Freshcode.club?

https://freshcode.club/
1•lucas_membrane•16m ago•1 comments

The Illustrated Evo 2

https://research.nvidia.com/labs/dbr/blog/illustrated-evo2/
1•jwilber•16m ago•0 comments

Alphabet's Letter to the House Judiciary Committee [pdf]

https://judiciary.house.gov/sites/evo-subsites/republicans-judiciary.house.gov/files/evo-media-do...
3•ImJamal•18m ago•0 comments

Privacy Commissioners find TikTok collected sensitive data from children

https://www.priv.gc.ca/en/opc-news/news-and-announcements/2025/nr-c_250923/
3•Improvement•18m ago•0 comments

Nuklear: A minimal-state, immediate-mode graphical user interface toolkit

https://github.com/Immediate-Mode-UI/Nuklear
2•jbm•19m ago•0 comments

Persistent sequences with insert and delete and canonical structure?

https://cs.stackexchange.com/questions/173598/persistent-sequences-with-insert-and-delete-and-can...
1•practal•20m ago•1 comments

Show HN: Inflow – invoke an LLM with your viewport just by typing

https://github.com/cboRD181/Inflow
1•vagabund•20m ago•0 comments

Rapid epigenomic classification of acute leukemia

https://www.nature.com/articles/s41588-025-02321-z
1•bookofjoe•21m ago•0 comments

Most Canadians think Trump would break any new trade deal

https://nationalpost.com/news/canada/candians-trump-trade-deal-poll
5•bbarnett•22m ago•5 comments

JEP Draft: Lazy Constants (Second Preview)

https://openjdk.org/jeps/8359894
1•za3faran•23m ago•0 comments

Is Fortran better than Python for teaching basics of numerical linear algebra?

https://loiseaujc.github.io/posts/blog-title/fortran_vs_python.html
3•Bostonian•26m ago•2 comments

Show HN: Shaders – A first-of-it's-kind component library for front end magic

https://twitter.com/npm_i_shaders/status/1970526500811350397
1•marchantweb•26m ago•0 comments

GNOME 49 adds a "Support GNOME" button, alongside a reminder twice a year

https://old.reddit.com/r/gnome/comments/1mpdmgx/gnome_49_adds_a_support_gnome_button_to_the_about/
1•perihelions•27m ago•0 comments

Why non-deterministic AI agents are the ultimate doom for enterprises

https://blog.inxm.ai/p/why-non-deterministic-ai-agents-are
1•oesimania•28m ago•1 comments

The Complete Guide to Dev Containers in Ruby on Rails

https://www.rorvswild.com/blog/2025/dev-containers-rails
1•hahahacorn•31m ago•0 comments

Request HN [Meta]: Noticed a bug with renamed submissions

2•add-sub-mul-div•35m ago•0 comments

A Novel Technique for SQL Injection in PDO's Prepared Statements

https://slcyber.io/assetnote-security-research-center/a-novel-technique-for-sql-injection-in-pdos...
1•donatj•36m ago•0 comments

Gemini Live API

https://twitter.com/GoogleAIStudio/status/1970545734736023564
2•simianwords•37m ago•0 comments

Denmark links drones at Copenhagen airport to hybrid attacks across Europe

https://www.reuters.com/world/europe/drones-that-shut-copenhagen-airport-flown-by-capable-operato...
4•JumpCrisscross•37m ago•0 comments

For the Founders: What are your launch strategies?

2•Verdierm•38m ago•3 comments

Something Is Very Wrong Online

https://www.theatlantic.com/technology/archive/2025/09/charlie-kirk-assassination-online-reaction...
1•byproxy•39m ago•1 comments
Open in hackernews

Show HN: SyGra – Graph-oriented Synthetic data generation Pipeline for LLMs

https://github.com/ServiceNow/SyGra
1•zephyrzilla•1h ago
We're open-sourcing SyGra, a framework for building reproducible synthetic-data pipelines for LLM training and evaluation (SFT, DPO, agent simulation, multimodal).

Problem:

High-quality datasets are scarce, expensive, and often sensitive. When teams turn to synthetic data, the difficulty isn't single prompts—it's the end-to-end system: designing branching/looping workflows, coordinating multiple inference backends/APIs and tool calls, enforcing validation + schema compliance + quality tagging at scale, and running fault-tolerant jobs with resumability, sharding, and streaming. Ad-hoc notebooks/scripts don't capture that lifecycle.

What SyGra is:

A graph-oriented framework where you define nodes (LLM calls, samplers, transforms, agents, subgraphs) and edges (conditional / parallel / loops). Author pipelines in low-code YAML (CLI-runnable) or compose in Python. Emphasis on structured outputs and reproducibility.

Key capabilities:

- Graph model: reusable subgraphs; conditional/parallel edges; loops

- Quality: dual-stage quality tagging (heuristics + LLM-based scoring); OASST-style conversation formatting

- Backends: vLLM, Hugging Face TGI, Azure OpenAI, Ollama (Triton-compatible)

- Data I/O: Hugging Face datasets (read/write, streaming) + local files; schema + metadata tracking

- Execution: async runtime; checkpointing/resume; sharding support; multimodal inputs (image/audio/text); agent/tool nodes via LangGraph

- Reproducibility: deterministic configs, seeds, artifact paths, and provenance logs

- Modes: CLI (execute YAML graphs) or Python APIs (embed in notebooks/apps)

- License: Apache-2.0

Links:

- Repo & README: https://github.com/ServiceNow/SyGra

- PyPI: https://pypi.org/project/sygra/

- Paper (design rationale): https://arxiv.org/abs/2508.15432

Disclosure: I'm part of the team behind SyGra.