frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

TensorSharp: Open Source Local LLM Inference Engine

https://github.com/zhongkaifu/TensorSharp
1•zhongkaifu•1h ago

Comments

zhongkaifu•1h ago
I would like to share my latest open source local Unsloth (GGUF) LLM inference engine and applications. It supports many models from Unsloth, like Gemma4, DiffusionGemma, Qwen3.6 with multi-modal (image, vision, audio), reasoning and function tool. It can run on Windows/MacOS/Linux and fully leverage GPU's capability. The API is completely compatible with OpenAI and Ollama interface. It has on par performance than llama.cpp

This project is not just a C# wrapper of llama.cpp. It implemented the entire LLM inference engine from bottom to top. If you use CPU backend, it's 100% pure C# code execution. Besides CPU backend, I also implmented CUDA, MLX and GGML backend. The GGML backend refer GGML project as external project, and I build a few fusion operation at higher level.

I learned a lot from other projects and apply them for TensorSharp, such as paged KV cache and continuous batching from vLLM, SSD based cache for MoE model from oMLX, GGUF quanztized from llama.cpp and other optimizations for prefill and decode.

Any feedback and comments are welcome. If you like it, it would be really appreciated if you can get this project a star in GitHub. Thanks in advance.

Snapcompact: SoTA Compaction – Instant, Local, Free. Pick 3

https://blog.can.ac/2026/06/10/snapcompact/
1•himata4113•43s ago•0 comments

Identifying Life-Changing Books with LLMs

https://blog.joellehman.com/identifying-life-changing-books-with-llms.html
1•andsoitis•2m ago•0 comments

Formal Methods and the Future of Programming

https://blog.janestreet.com/formal-methods-at-jane-street-index/
2•sebg•5m ago•0 comments

Shareholder Supremacy and the Precog CEO

https://pluralistic.net/2026/06/13/minority-shareholder-report/
1•hn_acker•5m ago•0 comments

Ask HN: What would you do with a trillion dollars

1•brudgers•5m ago•0 comments

The Future of Crossover

https://www.codeweavers.com/blog/mjohnson/2026/06/11/whats-in-and-whats-out-for-crossover-27
2•akyuu•5m ago•0 comments

Thanks Amazon

https://www.reddit.com/r/ClaudeAI/s/TUivYGKnCK
2•ihazgithub•5m ago•0 comments

Ask HN: If you had a trillion dollars, what would you do?

1•brudgers•6m ago•0 comments

Show HN: Domainbase – Instant domain search and management

https://domainbase.app/
1•alexpate•6m ago•0 comments

How cyber-criminals adopted Russias secret language of thieves

https://www.bbc.com/future/article/20260611-fenya-how-cyber-criminals-adopted-russias-secret-lang...
1•1659447091•6m ago•0 comments

Mandrake –> Gentoo a.k.a. "Mandrake Expatriate Syndrome" (2003)

https://www.greenfly.org/mes.html
1•coatmatter•7m ago•0 comments

Sealed Super Mario Bros Sells for $3M Setting New Record for a Video Game

https://www.ha.com/heritage-auctions-press-releases-and-news/highest-graded-super-mario-bros.-sel...
2•HelloUsername•7m ago•0 comments

Show HN: Sessemi – Scraping API That Solves Cloudflare/DataDome/Akamai Itself

https://sessemi.com
1•sessemi•8m ago•0 comments

Kennedy Center Says It Has Removed Trump's Name from Building

https://www.wsj.com/politics/policy/kennedy-center-misses-deadline-wants-more-time-to-remove-trum...
1•JumpCrisscross•9m ago•0 comments

Why India wants German submarines

https://www.dw.com/en/why-india-wants-german-submarines-and-what-pakistan-and-china-have-to-do-wi...
1•rustoo•11m ago•0 comments

The first trillionaire is a killer

https://www.theverge.com/tech/949259/the-worlds-first-trillionaire-is-a-killer
3•okneil•11m ago•0 comments

Seasonal changes in human hair growth

https://pubmed.ncbi.nlm.nih.gov/2003996/
1•JumpCrisscross•11m ago•0 comments

Why greatness cannot be planned

https://yinuoli.org/ken-stanley-and-joel-lehman-why-greatness-cant-be-planned/
1•andsoitis•11m ago•0 comments

What Happens to an Economy When It's Too Hot to Work?

https://www.bloomberg.com/news/features/2026-06-12/india-s-extreme-heat-is-hurting-its-economy-an...
3•littlexsparkee•15m ago•0 comments

Running DOS on Behringers DDX3216 with a DIY x86-Bios from Scratch

https://chrisdevblog.com/2026/06/08/running-dos-on-behringers-ddx3216-using-a-diy-x86-bios/
2•rasz•18m ago•0 comments

Something is jamming GPS over Europe. Here's what we found

https://www.youtube.com/watch?v=tz23G_UXCGA
2•nradov•19m ago•0 comments

Show HN: Deterministic and offline duplicate-code detector

https://github.com/Rafaelpta/dupehound
3•rafaepta•20m ago•0 comments

TinyWind

https://tinywind.io
3•kqr•22m ago•0 comments

Memory-mapped files considered harmful (for databases) (2022)

https://quasar.ai/2022/01/24/memory-mapped-files-considered-harmful/
2•tosh•24m ago•0 comments

Rows Are Made for Sorting and That's Just What We'll Do (2023) [pdf]

https://duckdb.org/pdf/ICDE2023-kuiper-muehleisen-sorting.pdf
2•tosh•24m ago•0 comments

Google Gemini-SQL2 tops text-to-SQL benchmarks

https://the-decoder.com/google-researchs-gemini-sql2-tops-text-to-sql-benchmarks-by-a-wide-margin/
2•geox•27m ago•0 comments

AI forgoes toxic positivity for neurodivergents

https://medium.com/@mantaman555/the-daily-exhaustion-of-waiting-mode-why-standard-productivity-sy...
3•FDX2018•27m ago•0 comments

Show HN: Seer – Private Ollama Chat in the Browser, No Account Needed

https://manticthink.com/
2•Colewilliamz•28m ago•0 comments

Crime theory: Rehabilitation, or harsh punishment

https://agoralogica.com/debates/cee5881c-a333-4e79-b81d-17552904a568
2•Phaedruss•29m ago•0 comments

There is not 'sentient plasma', refuting the claims of David Grusch

2•dabadabad00•32m ago•0 comments