frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Experimental Optical Encoder for Qwen3-VLM-2B-Instruct

https://github.com/Volkopat/VLM-Optical-Encoder
1•volkopat2•3h ago

Comments

volkopat2•3h ago
Hey everyone!

So I am quite amazed with the innovation in DeepSeek-OCR model! I wanted to break it apart and try it out myself, so I asked myself - what if I extract the encoder to fit other existing VLMs?

https://huggingface.co/Volkopat/DeepSeek-DeepEncoder

I didn't have any expectations and was doing this just for fun cos why not? Moving on, after vibe scripting with the encoder, I tried to patch this with Qwen3-VLM 2B. Due to difference in input dimensions of Qwen and the DeepSeek encoder, I pretrained a custom adapter to fit this piece of puzzle.

https://huggingface.co/Volkopat/Qwen-VLM-Optical-Encoder

Long story short - I noticed some performance gains in my experimental synthetic dataset as well as Longbench V2. You can check the project out and try it -

https://github.com/Volkopat/VLM-Optical-Encoder

I have added the training and test scripts in the repo.

In a miniscule test run of 50 cases of LongBench V2 benchmark - I noticed that the custom optical encoder with compressed visual tokens performed slightly better than the original Qwen encoder. It could be that 2B model is really weak for this benchmark.

I could be wrong in my approach so I don't want to hype this too much, and I am more curious to find out if this is scalable beyond 2B? I'm GPU poor with a 12 GB 5070 so I would love it if someone gives this a shot and try to take it further? Hope this helps!

A way to write Canonical LR parsers by hand [video]

https://www.youtube.com/watch?v=d-qyPFO5l1U
1•scorbiclife•2m ago•1 comments

'Chinese lantern' structure shifts into many shapes for various applications

https://techxplore.com/news/2025-10-chinese-lantern-shifts-dozen-applications.html
2•PaulHoule•4m ago•0 comments

Reinventing iOS Automation: Editorial Review

https://www.macstories.net/stories/editorial-for-ipad-review/
1•ijidak•6m ago•0 comments

Gluten sensitivity linked to gut–brain interaction, not gluten itself

https://medicalxpress.com/news/2025-10-gluten-sensitivity-linked-gutbrain-interaction.html
3•bikenaga•8m ago•0 comments

The Road to Flux 1.0

https://github.com/tcbrindle/flux/discussions/242
1•coffeeaddict1•8m ago•0 comments

React Flow, open source libraries for node-based UIs with React or Svelte

https://github.com/xyflow/xyflow
4•mountainview•14m ago•0 comments

Agent Engineering 101: Software, systems, and security in practice

https://www.ashpreetbedi.com/articles/agent-engineering
4•bediashpreet•17m ago•0 comments

Sora 2 Can Generate Videos of Celebs Appearing to Shout Racist Slurs

https://www.rollingstone.com/culture/culture-features/openai-sora-2-celebrities-racial-slurs-1235...
1•healsdata•20m ago•1 comments

The maps of Ursula K Le Guin reveal an insight into world-building

https://theconversation.com/the-maps-of-ursula-k-le-guin-reveal-a-fascinating-insight-into-world-...
2•sohkamyung•21m ago•0 comments

Google claims 'quantum advantage' again – but researchers are sceptical

https://www.nature.com/articles/d41586-025-03300-4
1•gnabgib•22m ago•0 comments

PlainErrors: Streamlined Rails Error Pages for LLM Agents

https://www.panozzaj.com/blog/2025/10/23/plainerrors-streamlined-rails-error-pages-for-llm-agents/
1•panozzaj•23m ago•1 comments

A New Browser from Perplexity

https://www.perplexity.ai/comet
1•alexpogosyan•33m ago•0 comments

Data Science Weekly – Issue 622

https://datascienceweekly.substack.com/p/data-science-weekly-issue-622
1•sebg•37m ago•0 comments

Building a stable 'abode of thought': Kant's rules for virtuous thinking

https://theconversation.com/building-a-stable-abode-of-thought-kants-rules-for-virtuous-thinking-...
2•bikenaga•40m ago•1 comments

Late-surviving New Mexican dinosaurs illuminate high diversity and provinciality

https://www.science.org/doi/10.1126/science.adw3282
1•Stratoscope•41m ago•1 comments

My Car Is Becoming a Brick (EVs are poised to age like smartphones)

https://www.theatlantic.com/technology/2025/10/electric-car-software-updates-tesla/684643/
2•ryan_j_naughton•44m ago•7 comments

Ask HN: Mamdani is poised to become new mayor of New York. How do locals feel?

2•frenchmajesty•47m ago•1 comments

A few patterns with Truchet tiles

https://carlosn.com.br/blog/post/a-few-patterns-with-truchet-tiles/
1•carlosneves•47m ago•0 comments

Tamuning debate exposes rifts over who should vote on political future of island

https://www.postguam.com/news/local/tamuning-debate-exposes-rifts-over-who-should-vote-on-politic...
1•sipofwater•48m ago•2 comments

Dwarkesh Patel's Podcast with Andrej Karpathy

https://thezvi.substack.com/p/on-dwarkesh-patels-podcast-with-andrej
1•paulpauper•50m ago•0 comments

Lod Streaming for Gaussian Splats by Playcanvas - Demo of 34M splats

https://twitter.com/playcanvas/status/1981341894287274192
1•smusamashah•52m ago•1 comments

Woman wins right to work from home every day in landmark case

https://www.independent.co.uk/news/world/australasia/westpac-work-from-home-jobs-australia-b28490...
2•dmitrygr•53m ago•0 comments

Introduction to the concept of likelihood and its applications (2018)

https://journals.sagepub.com/doi/10.1177/2515245917744314
3•sebg•55m ago•0 comments

Google's New Quantum Algorithm May Be Useful

https://spectrum.ieee.org/quantum-echoes
1•te•55m ago•0 comments

Non Equilibrium Statistical Mechanics and Biology

https://chillphysicsenjoyer.substack.com/p/non-equilibrium-statistical-mechanics
1•crescit_eundo•55m ago•1 comments

At least 25 states plan to cut off food aid benefits in November

https://www.politico.com/news/2025/10/23/states-snap-food-aid-benefits-government-shutdown-00619117
5•cypherpunks01•55m ago•0 comments

Scog: Easily generate shell completions for any binary (bash, zsh, fish)

https://github.com/vrmiguel/scog
2•vrmiguel•56m ago•1 comments

Ask HN: Does anyone use D(lang) regularly? And for what?

1•nateb2022•57m ago•0 comments

Java Performs Better When You Misspell Variable Names

https://medium.com/javarevisited/java-performs-better-when-you-misspell-variable-names-5b9709893121
2•crummy•58m ago•1 comments

The Goldfish Problem: Why AI Agents Need Smaller Bowls to Swim Effectively

https://blog.justcopy.ai/p/why-ai-agents-struggle-with-big-goals
8•anup_sia•58m ago•1 comments