frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Dialogue Between a Developer and a Kid

https://riggraz.dev/dialogue-developer.html
1•Growtika•5m ago•0 comments

Show HN: LTXMac a native Mac app to do text to video generation

https://james-see.github.io/ltx-video-mac/
1•jamescampbell•6m ago•0 comments

Show HN: Ever wanted to look at yourself in Braille?

https://github.com/NishantJoshi00/dith
2•cat-whisperer•8m ago•0 comments

Show HN: A Wall Street Terminal for Everyone

https://marketterminal.com/chart
1•adamfontan•12m ago•0 comments

How to Choose CD/DVD Archival Media (2013)

https://adterrasperaspera.com/blog/2006/10/30/how-to-choose-cddvd-archival-media/
1•walterbell•12m ago•0 comments

What Happened to WebAssembly

https://emnudge.dev/blog/what-happened-to-webassembly/
3•enz•13m ago•0 comments

There's a ridiculous amount of tech in a disposable vape

https://blog.jgc.org/2026/01/theres-ridiculous-amount-of-tech-in.html
1•rcarmo•14m ago•0 comments

Elon Musk's X must be banned

https://disconnect.blog/elon-musks-x-must-be-banned/
2•mnewme•14m ago•0 comments

Rethinking Information for Computationally Bounded Intelligence

https://arxiv.org/abs/2601.03220
1•tzury•15m ago•1 comments

As bombs fell, we committed an act of rebellion: we planted a garden in Gaza

https://www.theguardian.com/commentisfree/2026/jan/08/gaza-israel-palestine-garden-seed-food
5•ciconia•16m ago•0 comments

Iranian Censorship, Bypasses, Browser Extensions, and Proxies

https://joshua.hu/iranian-browser-extension-addon-censorship-bypasses
1•mmsc•22m ago•0 comments

Jxl-Rs Merged into Chromium

https://github.com/chromium/chromium/commit/3badff27281339878293e935a5e0fbb41da553bf
3•todsacerdoti•22m ago•0 comments

Join Us in Building LoongFlow – Cognitive Evolutionary AI Framework

https://github.com/baidu-baige/LoongFlow
1•FreshmanD•25m ago•1 comments

Stop Overthinking Struct Pointer and Value Semantics in Go

https://preslav.me/2026/01/08/golang-structs-vs-pointers-pointer-first/
1•ingve•26m ago•0 comments

Google Is Adding an 'AI Inbox' to Gmail That Summarizes Emails

https://www.wired.com/story/google-ai-inbox-gmail/
2•signa11•27m ago•0 comments

Episode 29 of the Dirk and Linus show

https://lwn.net/Articles/1050317/
2•signa11•29m ago•0 comments

Terence Tao's list of AI contributions to Erdős problems

https://github.com/teorth/erdosproblems/wiki/AI-contributions-to-Erd%C5%91s-problems
1•nomilk•29m ago•0 comments

Treating UI Regions as Independent Actors Makes Terminal State Manageable

https://www.rodriguez.today/articles/reactive-tui-architecture-with-actors
2•signa11•31m ago•0 comments

The Frontier Is Now Free

https://ampcode.com/news/amp-free-frontier
1•tosh•31m ago•0 comments

A Major Mail Provider Demonstrate They Likely Do Not Understand Mail at All

https://nxdomain.no/~peter/they_do_not_understand_mail_at_all.html
2•gpi•33m ago•0 comments

New Article: How to File a Patent Application Yourself

https://idea2patentai.com/articles/diy-provisional-patent-filing
1•idea2patentAI•36m ago•1 comments

CES 2026: We tried an AI supercomputer that fit in our pocket. Meet Tiiny AI

https://mashable.com/article/ces-2026-tiiny-ai-pocket-lab-ai-supercomputer
1•_____k•37m ago•0 comments

Claude-quill your inline parallel coderabbit

https://github.com/blas0/claude-quill
1•blas0•39m ago•1 comments

European Commission issues call for evidence on open source

https://lwn.net/Articles/1053107/
3•pabs3•42m ago•0 comments

Mathematics for Computer Science (2018) [pdf]

https://courses.csail.mit.edu/6.042/spring18/mcs.pdf
20•vismit2000•45m ago•0 comments

Show HN: I built an AI tool to fight NYC's new "Acoustic Camera" tickets ($800)

https://nycnoisecameraticket.com
2•todaycompanies•45m ago•1 comments

Preview and edit marketing images before production

https://vect.pro/#/signup?continue=%2Fapp%2Ftools%3Ftool%3DAI+Image+Studio
2•MMAFRAZ•46m ago•1 comments

Rise of AI chatbots for shopping boosts analyst hopes for Shopify's growth

https://www.theglobeandmail.com/business/article-shopify-ai-chatbots-online-shopping-growth-plans/
1•petethomas•49m ago•0 comments

How to Protest Safely in the Age of Surveillance

https://www.wired.com/story/how-to-protest-safely-surveillance-digital-privacy/
5•saikatsg•50m ago•0 comments

Show HN: Workzonespeedingticket.com – Automating disputes for automated fines

https://workzonespeedingticket.com/
2•todaycompanies•54m ago•1 comments
Open in hackernews

LLM-feat: Python library for automated feature engineering with Pandas

https://pypi.org/project/llm-feat/
1•srinivaskumarr•20h ago

Comments

srinivaskumarr•20h ago
*What My Project Does:*

llm-feat is a Python library that uses OpenAI LLMs (like GPT-4) to automatically generate feature engineering code for pandas DataFrames. You provide your DataFrame and metadata describing what each column means, and the LLM generates context-aware feature engineering code that actually makes sense for your domain.

The library works directly in Jupyter notebooks - when you call the function, the generated code automatically appears in the next cell. You can also get detailed reports explaining the rationale behind each feature, which helps you understand what the LLM is thinking and why certain features were created.

Under the hood, it uses GPT-4's understanding of domain context to generate features that are specific to your problem. For example, when tested on a medical dataset, it generated clinically relevant features like lipid ratios (LDL/HDL) and BMI interactions that a generic rule-based library wouldn't know to create.

*Target Audience:*

This library is designed for:

- Data Scientists and ML Engineers building predictive models who want to speed up the feature engineering process without sacrificing domain relevance.

- ML Practitioners working on real projects who need production-ready tools (I've been using it in my own work), especially useful during the exploratory phase when you're trying to figure out what features might work.

- Anyone tired of manually engineering features and wants an intelligent assistant that understands context rather than just applying generic transformations.

*Comparison:*

vs. Rule-based libraries (featuretools, tsfresh): These libraries use predefined transformation rules that work across all domains but don't understand context. llm-feat uses LLMs to understand your specific domain and generate features that are relevant to your problem. For example, on a medical dataset, it generated lipid ratios and composite risk scores that a generic library wouldn't create.

vs. AutoML tools (AutoGluon, H2O AutoML): AutoML tools are black boxes that handle the entire ML pipeline. llm-feat gives you the actual code to review, modify, and understand. You maintain full control over your feature engineering process while getting intelligent suggestions.

vs. Manual feature engineering: Obviously much faster - what would take hours of domain research and coding happens in seconds. Plus, the LLM often suggests features you might not have thought of.

*Results:*

Tested on the Diabetes dataset: - Baseline: RMSE 54.33 with 10 original features - With LLM features: RMSE 53.53 with 20 features (10 original + 10 generated) - Improvement: 1.47% RMSE reduction, R² improved from 0.44 to 0.46

The generated features included lipid ratios, BMI interactions, and composite risk scores that were clinically relevant and improved model performance.

*Links & Source:*

GitHub: https://github.com/codeastra2/llm-feat

PyPI: pip install llm-feat

I would love feedback on the API design or suggestions for improvements!