news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Can't Beat BERT–comparing small LLMs and fine-tuned encoders on classification

https://alex-jacobs.com/posts/beatingbert/

3•tacoooooooo•1mo ago

Comments

PaulHoule•1mo ago

The BERT models are easy to probability calibrate too!

BERT + pooling + SVM works pretty good for some problems and is maybe 20x faster to train than the fine-tuned BERT.

My take as an academic-adjacent [1] developer of boring and reliable applications is that I don't like the training recipes people use for fine-tuned BERT [2] and think that BERT + biLSTM + probability calibration should equal or exceed those fine-tuned BERTs particularly because I think I can add early stopping and do model selection with a parameter scan.

[1] reads arXiv papers where run-of-the-mill researchers solve run-of-the-mill problems

[2] particularly as the number of samples is >> 500 which is easy to get in many cases; e.g. for most tasks you can make 1-2k judgements a day though with visual tasks when I've done 5k a day sprints for a few days I start to hallucinate and compulsive classify scenes in front of me

Study confirms experience beats youthful enthusiasm

https://www.theregister.com/2026/02/07/boomers_vs_zoomers_workplace/

1•Willingham•6m ago•0 comments

The Big Hunger by Walter J Miller, Jr. (1952)

https://lauriepenny.substack.com/p/the-big-hunger

1•shervinafshar•7m ago•0 comments

The Genus Amanita

https://www.mushroomexpert.com/amanita.html

1•rolph•12m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/

1•mooreds•12m ago•1 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•13m ago•0 comments

Ask HN: How to Reduce Time Spent Crimping?

1•pinkmuffinere•15m ago•0 comments

KV Cache Transform Coding for Compact Storage in LLM Inference

https://arxiv.org/abs/2511.01815

1•walterbell•19m ago•0 comments

A quantitative, multimodal wearable bioelectronic device for stress assessment

https://www.nature.com/articles/s41467-025-67747-9

1•PaulHoule•21m ago•0 comments

Why Big Tech Is Throwing Cash into India in Quest for AI Supremacy

https://www.wsj.com/world/india/why-big-tech-is-throwing-cash-into-india-in-quest-for-ai-supremac...

1•saikatsg•21m ago•0 comments

How to shoot yourself in the foot – 2026 edition

https://github.com/aweussom/HowToShootYourselfInTheFoot

1•aweussom•22m ago•0 comments

Eight More Months of Agents

https://crawshaw.io/blog/eight-more-months-of-agents

3•archb•24m ago•0 comments

From Human Thought to Machine Coordination

https://www.psychologytoday.com/us/blog/the-digital-self/202602/from-human-thought-to-machine-coo...

1•walterbell•24m ago•0 comments

The new X API pricing must be a joke

https://developer.x.com/

1•danver0•25m ago•0 comments

Show HN: RMA Dashboard fast SAST results for monorepos (SARIF and triage)

https://rma-dashboard.bukhari-kibuka7.workers.dev/

1•bumahkib7•25m ago•0 comments

Show HN: Source code graphRAG for Java/Kotlin development based on jQAssistant

https://github.com/2015xli/jqassistant-graph-rag

1•artigent•30m ago•0 comments

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor

3•dragandj•32m ago•0 comments

Tmux to Zellij (and Back)

https://www.mauriciopoppe.com/notes/tmux-to-zellij/

1•maurizzzio•32m ago•1 comments

Ask HN: How are you using specialized agents to accelerate your work?

1•otterley•34m ago•0 comments

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/

1•pranay01•35m ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/

1•todsacerdoti•35m ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel

1•Sean766•37m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos

1•fluantix•38m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/

1•MaximilianEmel•38m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf

1•mooreds•38m ago•1 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app

1•sngahane•40m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/

1•gaws•42m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba

1•mooreds•42m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747

1•paulpauper•43m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/

1•cherrylinedev•44m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...

2•mooreds•44m ago•0 comments