frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I'm 11 and trained a custom MoE LLM for $1

3•Hey1-Arthur•1h ago
# I'm 11 years old and I trained my own LLM from scratch. 50 people downloaded it in 24 hours.

Hey r/LocalLLaMA,

I'm Arthur, I'm 11 years old, and I just released *Wind Arc 1.6* — a custom architecture LLM I built and trained myself.

## What it is

Wind Arc 1.6 is a 3.6B parameter model with a custom architecture I designed:

- *Mixture of Experts FFN* — 4 routed experts + 1 shared expert per layer (replaces standard MLP) - *YaRN RoPE* — extends context from 8k → 32k tokens - *Hybrid Attention* — full attention every 4th layer, sliding window otherwise - *QK-Norm* for training stability

Base: Qwen3-1.7B with the FFN layers completely replaced by my custom MoE. (Fully custom)

## How I trained it

- Hardware: 1× RTX 5090 rented from Nova Cloud - Cost: literally $1 - Time: 55 minutes - Final loss: 2.66

Data mix: smoltalk + python-codes-25k + FineWeb-Edu + custom identity and Christian Q&A data I wrote myself.

## What it's good at

- Python and general coding with explanations - Christian questions (Bible, theology, Christian living) - General chat and learning

## The honest truth

It's not GPT-4. Loss of 2.66 on a 1.7B base with 55 minutes of training isn't going to beat frontier models. But it runs locally, it's open source, and it's mine.

I still need to do SFT (the identity responses aren't perfect yet) and GGUF conversion is blocked by the custom MoE architecture. Working on both.

## Why I built it

I'm building *North.ai* — an AI startup focused on powerful models that run on small hardware. Wind Arc is our flagship model. Our platform Neurotype will let anyone train, deploy, and use AI without needing expensive cloud budgets.

I've trained 10+ models. This is the first one I'm actually proud enough to release.

## Links

HuggingFace: https://huggingface.co/arthu1/wind-arc-1-6

Would love feedback from people who actually know what they're doing. Be honest — I can take it.

— Arthur, age 11

EPA Rejects Colorado's Regional Haze Plan over Disputed Coal Plant Closure

https://eelp.law.harvard.edu/epa-rejects-colorados-regional-haze-plan-over-disputed-coal-plant-cl...
1•geox•2m ago•0 comments

Stop Teaching R. Teach Python.

https://andrewpwheeler.com/2026/03/22/stop-teaching-r-teach-python/
1•apwheele•6m ago•0 comments

Agent Traffic Control

https://www.agenttrafficcontrol.com/
1•handfuloflight•8m ago•0 comments

The Abstraction Layer

https://www.swiftjectivec.com/the-abstraction-layer/
1•ingve•10m ago•0 comments

Terafab: the next step towards becoming a galactic civilization

https://twitter.com/spacex/status/2035519125284380672
1•simonebrunozzi•11m ago•0 comments

Show HN: Foundations of Music (FoM)

https://bookerapp.replit.app/book/fom
1•ersinesen•11m ago•0 comments

Intelligence, Agency, and the Human Will of AI

https://larrymuhlstein.substack.com/p/intelligence-agency-and-the-human
1•lmuhlstein•14m ago•0 comments

Show HN: Burrow, a Gopher browser/proxy written in JavaScript

https://burrow.din.gy/
1•treve•14m ago•0 comments

TrustClaw

https://www.trustclaw.app/
1•wallflower•18m ago•0 comments

RSS Creator on Bluesky and at Proto

https://zeldman.com/2026/03/22/rss-creator-on-bluesky-at-proto/
1•8organicbits•19m ago•0 comments

Interviewing tactics for a post-LLM world

https://blog.incrementalforgetting.tech/p/interviewing-tactics-for-a-post-llm
1•BerislavLopac•20m ago•0 comments

I built a free interactive platform to learn KDB/q

https://kdb-academy.web.app/
2•kdv-cave•22m ago•1 comments

Auto Translate JSON Library – JSON and Google to Multi-Format and 8 Providers

https://github.com/topce/auto-translate-json-library
1•topce•22m ago•0 comments

Watchtower

https://github.com/fahd09/watchtower
1•handfuloflight•25m ago•0 comments

Personal Computing (2022)

https://josh8.com/blog/personal_computing.html
3•xk3•26m ago•0 comments

I built a photonic AI chip for space with 860x less power, rad-hard to 106 krad

https://github.com/venticedlatte/pushinka-engine
2•ventiproject•30m ago•1 comments

Sorry They Are Just Not That into You

https://marcrandolph.com/sorry-they-are-just-not-that-into-you-2/
1•theorchid•30m ago•0 comments

Amazon Plans Smartphone Comeback

https://www.reuters.com/technology/amazon-plans-smartphone-comeback-more-than-decade-after-fire-p...
3•tosh•32m ago•0 comments

Google Drive Killer Just Got Better – Nextcloud Hub 26 and Upgrade Guide

https://www.youtube.com/watch?v=6wMeL9xlzag
1•doener•33m ago•0 comments

How to customize Firefox AI sidebar

https://gist.github.com/breisa/bd12cfa9f2c9cea8318607361a796310
1•breisa•34m ago•1 comments

Show HN: ClaudeRank – Open-source token and concurrency widget for Claude Code

https://www.clauderank.com/
2•ymaws•34m ago•0 comments

Program neural networks by shaping energy landscapes

https://hlm.qriton.com/
1•ddmma•36m ago•0 comments

A podcast about about how Big Tech is ruining the industry

https://limeleaf.coop/podcast/
1•mooreds•38m ago•0 comments

Why Spotify AI more than music will be the secret to keeping subscribers

https://www.cnbc.com/2026/03/22/spotify-apple-amazon-streaming-music-ai.html
1•randycupertino•42m ago•0 comments

buddy

https://github.com/tavmem/buddy
2•tosh•44m ago•0 comments

I Migrated My Blog from WordPress to Astro

https://timleland.com/how-i-migrated-this-blog-from-wordpress-to-astro/
1•TimLeland•46m ago•0 comments

AI videos of sexualised black women removed from TikTok after BBC investigation

https://www.bbc.com/news/articles/c070e283k8vo
1•randycupertino•49m ago•0 comments

Caddy Docker image in Spain is inaccessible

http://docker-images-prod.6aa30f8b08e16409b46e0173d6de2f56.r2.cloudflarestorage.com/
1•evilmonkey19•51m ago•1 comments

Superpower, a Peptide Startup Making People Hotter, Smarter

https://www.businessinsider.com/inside-superpower-peptide-startup-people-hotter-smarter-2026-3
1•msolujic•51m ago•0 comments

Apply Less, but Better

https://www.adriankrebs.ch/blog/apply-less-but-better/
2•hubraumhugo•53m ago•1 comments