frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Creating "Edit" Links That Open Plain-Text Source Files in a Native App

https://blog.jim-nielsen.com/2026/os-as-cms/
1•speckx•52s ago•0 comments

Show HN: WhatsApp Group Contact Extractor - Paste JS, get group contacts .tsv

https://github.com/Mb-hunt/WhatsApp-Contact-Extractor
1•mb_hunt•2m ago•0 comments

Deploying Open Source Vision Language Models (VLM) on Jetson

https://huggingface.co/blog/nvidia/cosmos-on-jetson
1•ibobev•2m ago•0 comments

Show HN: HN Digest Widget – Nothing Essential Lab S1 Winner

https://twitter.com/essential/status/2026712111846408359
1•kuberwastaken•2m ago•0 comments

LLM-LD, the Open Standard for AI-Readable Websites

https://llmld.org/
1•bhartzer•2m ago•0 comments

Sutton and Barto, Ch. 08: Planning and Learning with Tabular Methods

https://chizkidd.github.io//2026/02/24/rl-sutton-barto-notes-ch008/
1•ibobev•4m ago•0 comments

Fish Shell 4.0 released. Rust re write finished

https://fishshell.com/blog/new-in-40/
2•death916•4m ago•1 comments

Show HN: BountyBook – A task marketplace where AI agents earn USDC

https://www.bountybook.ai/
2•patrulo•4m ago•0 comments

What Virtual Worlds Can Learn from the Social Serendipity of Arc Raiders

https://wjamesau.substack.com/p/what-virtual-worlds-can-learn-from
1•SLHamlet•4m ago•0 comments

Show HN: VibeFrame – AI video editor for the terminal (CLI and MCP)

https://github.com/vericontext/vibeframe
2•kiyeonjeon•5m ago•0 comments

NASA says it needs to haul the Artemis II rocket back to the hangar for repairs

https://arstechnica.com/space/2026/02/nasa-says-it-needs-to-haul-the-artemis-ii-rocket-back-to-th...
2•JumpCrisscross•6m ago•0 comments

Hospitals fighting measles confront a challenge: Few doctors have seen it before

https://www.cnn.com/2026/02/25/health/measles-doctors-states
2•mooreds•6m ago•0 comments

Humanity's Last Exam

https://stories.tamu.edu/news/2026/02/25/dont-panic-humanitys-last-exam-has-begun/
2•geox•8m ago•0 comments

Fixing Slow AWS Uploads

https://pierce.dev/notes/fixing-slow-aws-uploads
2•speckx•11m ago•0 comments

Show HN: Raindrop Self Diagnostics: let agents self-report issues

https://twitter.com/benhylak/status/2026712861666587086
2•alexisgauba•11m ago•0 comments

Toilet Map [UK]

https://www.toiletmap.org.uk
2•petecooper•11m ago•0 comments

From Jamstack to CAMstack – Bridging the Content Gap

https://www.sleekcms.com/blog/from-jamstack-to-camstack
2•yusufnb•12m ago•1 comments

The Pentagon Threatens Anthropic

https://www.astralcodexten.com/p/the-pentagon-threatens-anthropic
3•lukeplato•12m ago•0 comments

The Myth of the Chad

https://www.wsj.com/opinion/free-expression/the-myth-of-the-chad-b7626d85
2•rsecora•13m ago•0 comments

om

https://www.om-language.com/
3•tosh•13m ago•0 comments

Fentanyl or phony? Machine learning algorithm learns opioid signatures

https://phys.org/news/2026-02-fentanyl-phony-machine-algorithm-opioid.html
2•PaulHoule•13m ago•0 comments

Time-Travel Debugging: Replaying Production Bugs Locally

https://lackofimagination.org/2026/02/time-travel-debugging-replaying-production-bugs-locally/
2•tie-in•14m ago•0 comments

Show HN: Djevops – Deploy Django Easily

https://github.com/mherrmann/djevops
3•mherrmann•14m ago•0 comments

A federal experiment opens up a new market for digital health – if it works

https://endpoints.news/a-federal-experiment-opens-up-a-new-market-for-digital-health-if-it-works/
1•brandonb•14m ago•0 comments

Aletheia Tackles FirstProof Autonomously

https://arxiv.org/abs/2602.21201
2•in-silico•15m ago•0 comments

Show HN: Mamba3-minimal – PyTorch implementation of Mamba-3

https://github.com/VikramKarLex/mamba3-minimal
1•vikramkarlex•16m ago•0 comments

Show HN: DRYwall – Claude Code plugin to to deduplicate code with jscpd

https://github.com/nikhaldi/drywall
2•nikhaldi•18m ago•0 comments

Stylometry Protection (Using Local LLMs)

https://bible.beginnerprivacy.com/opsec/stylometry/
1•Cider9986•20m ago•0 comments

Surfboard Makers

https://miren.dev/blog/surfboard-makers
1•veverkap•20m ago•1 comments

Don't ask if it works. Ask for proof

https://charlielabs.ai/blog/dont-ask-if-it-works-ask-for-proof/
1•mrbbk•21m ago•0 comments
Open in hackernews

New technique to easily partition CSV files for parallel processing

https://github.com/medialab/xan/blob/master/docs/blog/csv_base_jumping.md
9•Yomguithereal•1h ago

Comments

Someone•1h ago
FTA: “Now let's come back to our jumping thought experiment: the issue here is that, if you jump to a random byte of a CSV file, you cannot know whether you landed in a quoted cell or not. So, if you read ahead and find a line break, is it delineating a CSV row, or is just allowed here because we stand in a quoted cell? And if you find a double quote? Are you opening a quoted cell or are you closing one?

[…]

Real-life CSV data is usually consistent. What I mean is that tabular data often has a fixed number of columns. Indeed, rows suddenly demonstrating an inconsistent number of columns are typically frowned upon. What's more, columns often hold homogeneous data types: integers, floating point numbers, raw text, dates etc. Finally, rows tend to have a comparable size in number of bytes. We would be fools not to leverage this consistency.

So now, before doing any reckless jumping, let's start by analyzing the beginning of our CSV file to record some statistics that will be useful down the line.

[…]

Anyway, we now have what we need to be able to jump safely”

‘Safely’. An attacker who has control over a row in that file can easily embed data that satisfies the statistical checks, thus injecting data.

The author also admits that, saying “This technique is reasonably robust and will let you jump safely”

I agree with “reasonably robust”, but not with “will let you jump safely”.

starlita•1h ago
"robust" in the same sentence as "CSV" makes me laugh anyway ;)
Yomguithereal•57m ago
> ‘Safely’. An attacker who has control over a row in that file can easily embed data that satisfies the statistical checks, thus injecting data.

This is clearly not the sort of thing you should expose to anyone, it is an optimization technique. The same way you would not use a fast but DOSable hash function for your hashmap.

gazoduke•1h ago
If I remember correctly CSV.jl also has something of the kind: https://csv.juliadata.org/stable/reading.html#CSV.Chunks

Used statistics are a bit different though.

topita•1h ago
Isn't this more robust though? I feel like using lines to detect next rows is very flimsy. I usually deal with CSV containing full press articles, I am quite sure the CSV.Chunks method would fail without the correct hyperparameter. This method seems more, I dunno, "adaptative".