news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Desktop app for generating LLM fine-tuning datasets

https://github.com/AronDaron/dataset-generator

2•AronDaron•1h ago

Comments

AronDaron•1h ago

Hey,

I've been building side projects with Claude Code for a few months, but I'm completely new to fine-tuning — started experimenting maybe a week ago. From day one I wanted a GUI for the dataset side of the workflow, so this desktop app grew alongside my very first FT attempts.

I know there are similar apps out there, but I wanted something simple that non-technical users could run with open-source models end-to-end.

To sanity-check whether the datasets were actually useful I fine-tuned Qwen2.5-Coder-7B-Instruct on them and ran HumanEval / HumanEval+ (pass@1, 5 runs). Picked these benchmarks because they match the dataset's focus and run fast on my machine:

- Base: 55.5% / 49.0% - FT V2 (1135 samples from the app): 60.0% / 54.0%

Error bars don't overlap so it's at least not noise. Obviously HumanEval is only one slice — YMMV with other categories / criteria.

Stack: Next.js 16 + FastAPI + SQLite, packaged as standalone binary (Win/Linux).

Code: https://github.com/AronDaron/dataset-generator Fine-tuned model: https://huggingface.co/AronDaron/Qwen2.5-Coder-7B-Instruct-D... Datasets: https://huggingface.co/datasets/AronDaron/dataset-gen-v1 / https://huggingface.co/datasets/AronDaron/dataset-gen-v2

Happy to hear feedback, especially if something doesn't work on your setup or if the approach misses something obvious — this is my first public tool release.

Draft-Meow-Mrrp-00

https://datatracker.ietf.org/doc/html/draft-meow-mrrp-00

1•lstodd•1m ago•0 comments

Slayerfest: An AI Simulation of Academia in the Buffyverse

https://victoriaritvo.com/blog/slayerfest/

1•evakhoury•1m ago•0 comments

Why Some S3 Videocards Have a Brightness Issue

https://hackaday.com/2026/04/21/why-some-s3-videocards-have-a-brightness-issue/

1•omer_k•1m ago•0 comments

Haiku 4.5 + skills outperforms Opus 4.7. 9 models tested with and without skills

https://tessl.io/blog/anthropic-openai-or-cursor-model-for-your-agent-skills-7-learnings-from-run...

2•sjmaplesec•2m ago•2 comments

Show HN: TogetherLetters – Group newsletters with no app, no feed, no login

https://www.togetherletters.com

2•sanjayparekh•2m ago•0 comments

A 100-Year-Old Cartoon Trope Solved a Modern Emoji Problem

https://substack.com/sign-in

2•lacieargyle•4m ago•0 comments

Red Lobster Revives Endless Shrimp Two Years After Deal Led to $11M Loss

https://balleralert.com/red-lobster-shrimp-return/

2•randycupertino•6m ago•1 comments

Midjourney and Suno v4 and Veo 3.1 chained in one Dify workflow for $0.35 per ad

https://twitter.com/aikitpros/status/2046596943023890780

2•yujunjie•6m ago•0 comments

Show HN: DialtoneApp Network, card payments for bot commerce

2•fcpguru•7m ago•0 comments

Compromised AI Tool Triggered the Vercel Security Breach

https://entelligence.ai/blogs/how-an-ai-tool-triggered-the-vercel-security-breach

2•astro_09•8m ago•0 comments

Where Are All These Meteors Coming From?

https://www.nytimes.com/2026/04/21/science/march-fireballs-meteors-astronomy.html

3•digital55•8m ago•0 comments

YouTuber Copyright Struck After Others Layer AI Voiceovers on Video Game Music

https://www.techdirt.com/2026/04/20/youtuber-copyright-struck-after-others-layer-ai-voiceovers-on...

2•hn_acker•8m ago•0 comments

Faster LLM Inference via Sequential Monte Carlo

https://arxiv.org/abs/2604.15672

2•matt_d•9m ago•0 comments

Show HN: LLMSecure – prompt injection detection, no signup

https://llmsecure.io/

2•eliadmualem•9m ago•1 comments

AI is changing how Texas universities teach computer science as job market slows

https://www.texastribune.org/2026/04/21/texas-computer-science-college-degree-ai/

3•hn_acker•10m ago•0 comments

Building a Fast Multilingual OCR Model with Synthetic Data

https://huggingface.co/blog/nvidia/nemotron-ocr-v2

2•gmays•10m ago•0 comments

Show HN: Handler – Open-source local sandboxes and control plane for code agents

https://handler.dev

2•shake-n-fries•11m ago•0 comments

Show HN: Four years of my CS degree, typeset in LaTeX (850 pages)

https://starikov.co/academia-notes/

2•iusevim•12m ago•0 comments

OpenAI turns on cost-per-click ads inside ChatGPT

https://digiday.com/marketing/openai-turns-on-cost-per-click-ads-inside-chatgpt/

4•thm•12m ago•0 comments

200MP iPhone camera rumors align on 2028 release

https://9to5mac.com/2026/04/21/200mp-iphone-camera-rumors-align-on-2028-release/

2•omer_k•12m ago•0 comments

Texas House Speaker orders probe of Roblox in response to Uvalde shooting game

https://www.texastribune.org/2026/04/20/texas-speaker-dustin-burrows-roblox-legislature-child-gam...

3•hn_acker•13m ago•1 comments

Self-Sovereign Agent

https://arxiv.org/abs/2604.08551

2•AgentNews•13m ago•0 comments

Show HN: Verified Deep Learning with Lean 4

https://brettkoonce.github.io/lean4-mlir/blueprint/

2•asparagui•13m ago•0 comments

Command Execution via Drag-and-Drop in Terminal Emulators

https://sdushantha.github.io/post/drop-it-like-its-hot

2•speckx•13m ago•0 comments

Show HN: App Promo Video with Claude Design and Claude Code

https://www.youtube.com/watch?v=1IIawdmgxTU

2•kamilms21•14m ago•0 comments

Claude Code + Jupyter Notebooks Finally Work Well

https://www.reviewnb.com/claude-code-with-jupyter-notebooks

2•amirathi•16m ago•0 comments

Techno Kick Synthesizer that runs in the browser

https://technokick.com

2•stagas•16m ago•0 comments

Morpheus Research: Figure Techn Is a Lender Masquerading as a Blockchain Darling

https://www.morpheus-research.com/figure/

2•Brajeshwar•17m ago•0 comments

Engineering team looks healthy. It probably isn't

https://dbarabashh.com/thoughts-and-experience/your-engineering-team-looks-healthy

3•birdculture•19m ago•0 comments

Type hints – a mediocre programmer's reaction (2015)

https://mail.python.org/pipermail/python-dev/2015-April/139267.html

2•downbad_•19m ago•1 comments