frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

InfoSeek: The First Open-Source Framework for Deep Research Data Synthesis

1•BAAIBeijing•2h ago

  - The First Open-source Dataset Purpose-built for Deep Research tasks 
    - InfoSeek is the industry’s first dataset systematically designed for Deep Research tasks. It goes beyond the limitations of traditional QA and multi-hop QA by focusing on complex, hierarchical Deep Research problems, filling a critical gap in high-quality training data.
  - End-to-end Open Source: Dataset + Data Synthesis Framework 
    - Both the dataset and its generation framework are fully open-sourced, enabling researchers to freely extend and adapt it.  
    - Leveraging tree-structured generation and backtracking verification, InfoSeek can automatically synthesize complex, multi-level questions while ensuring correctness.  
  - 50,000+ High-Quality, Multi-Step Reasoning Samples
    - The dataset contains over 50,000 high-quality samples, each requiring on average 4–6 reasoning steps.  
    - Even advanced models such as Qwen2.5-72B + CoT still fail 91.6% of the time on the test set, highlighting the difficulty and rigor of InfoSeek.   
  - Resource Links
    -https://huggingface.co/datasets/Lk123/InfoSeek
    - https://github.com/VectorSpaceLab/InfoSeek
    - https://arxiv.org/abs/2509.00375

Comments

zephyrfalcon•1h ago
THIS InfoSeek? https://en.wikipedia.org/wiki/Infoseek Probably not...

Political Violence Makes No Sense

https://avi-loeb.medium.com/political-violence-makes-no-sense-cee20addd441
1•BruceEel•30s ago•0 comments

Apple Photos App Corrupts Images

https://tenderlovemaking.com/2025/09/17/apple-photos-app-corrupts-images/
2•pattyj•2m ago•0 comments

Safe Chain: Stopping Malicious NPM Packages Before They Wreck Your Project

https://www.aikido.dev/blog/introducing-safe-chain
1•danfritz•3m ago•0 comments

Ask HN/PG: Is the average screen resolution good enough for serif fonts yet?

1•Y_Y•6m ago•0 comments

Naming Software Teams

https://staysaasy.com/management/2025/07/06/team-names.html
1•thisismytest•7m ago•0 comments

Math Resource – "Hard Math for Elementary School"

https://kidswholovemath.substack.com/p/math-resource-hard-math-for-elementary
1•sebg•7m ago•0 comments

Lawyers challenge widespread police use of number-plate software as evidence

https://www.rnz.co.nz/news/national/573275/lawyers-challenge-widespread-police-use-of-number-plat...
1•Improvement•8m ago•0 comments

ChatGPT may soon require ID verification from adults

https://arstechnica.com/ai/2025/09/chatgpt-may-soon-require-id-verification-from-adults-ceo-says/
1•SlackingOff123•14m ago•0 comments

Build ClickHouse-Powered APIs with React and MooseStack

https://clickhouse.com/blog/clickhouse-powered-apis-in-react-app-moosestack
1•Liriel•19m ago•0 comments

Luxury Home and Commercial Remodels [video]

https://www.youtube.com/watch?v=PuvcxNJG9Ks
1•laraavino•20m ago•0 comments

Tech companies measure the impact of AI on software development

https://newsletter.pragmaticengineer.com/p/how-tech-companies-measure-the-impact-of-ai
1•kiyanwang•24m ago•0 comments

Writing a C Compiler, in Zig

https://asibahi.github.io/thoughts/c-compiler-1-zig/
3•ibobev•26m ago•0 comments

Designing Trust at Scale in Travel: What Moved Retention for Wego

1•emmanol•28m ago•0 comments

How people are using ChatGPT

https://openai.com/index/how-people-are-using-chatgpt/
1•cebert•28m ago•0 comments

Cuprum2929 Provides a Better Learning Experience Than AI

https://www.vaslabs.io/post/how-cuprum2929-provides-a-better-learning-experience-than-ai
1•vaslabsltd•28m ago•1 comments

The popular esbuild package (70M weekly downloads) has 0 dependencies

https://www.npmjs.com/package/esbuild?activeTab=dependencies
2•AbuAssar•30m ago•0 comments

Making Postgres scale to zero with CNPG

https://xata.io/blog/making-postgres-scale-to-zero-with-cnpg
2•gulcin•31m ago•0 comments

Ask HN: Why is GPT-5 almost the same as GPT-4?

1•whyandgrowth•32m ago•0 comments

"China keeps the algorithm": Critics attack Trump's TikTok deal

https://arstechnica.com/tech-policy/2025/09/china-keeps-the-algorithm-critics-attack-trumps-tikto...
4•pseudolus•37m ago•0 comments

Social media is a gnarly coordination problem we need to decompose

https://awarm.leaflet.pub/3lyzchme2d22b
2•jpereira•38m ago•0 comments

Kuo: OLED MacBook Pro to Feature Touch Screen Display

https://www.macrumors.com/2025/09/17/kuo-2026-oled-macbook-pro-touch-panel/
1•tosh•40m ago•0 comments

Simple-Datatables

https://github.com/fiduswriter/simple-datatables
1•palmfacehn•40m ago•0 comments

How to resolve common compatibility issues with ODF files

https://blog.documentfoundation.org/blog/2025/09/12/how-to-resolve-odf-compatibility-issues/
1•PaulHoule•43m ago•0 comments

Determination of the fifth Busy Beaver value

https://arxiv.org/abs/2509.12337
3•marvinborner•44m ago•0 comments

Erlang OTP 28.1 Released

https://www.erlang.org/patches/otp-28.1
2•sofetch•45m ago•0 comments

Scanoss GitHub Actions Adds Dependency Track Integration

1•scanosss•48m ago•0 comments

1969: Computer Banking and the End of Cash – BBC Archive [video]

https://www.youtube.com/watch?v=QiuIQNa-0-0
2•mon_•51m ago•0 comments

The Fink Project

https://www.finkproject.org/index.php
2•BruceEel•53m ago•0 comments

PureVPN IPv6 Leak

https://anagogistis.com/posts/purevpn-ipv6-leak/
3•todsacerdoti•1h ago•0 comments

Bob is always on lunch break

https://always-on-lunchbreak.lovable.app/
1•tokenomics•1h ago•0 comments