frontpage.

Hi HN! I built Shimmy, a lightweight AI inference server that can now load HuggingFace SafeTensors models directly without any Python dependencies.

  The core problem: I wanted to run HuggingFace models locally but didn't want the heavyweight Python ML stack. Most solutions require Python + PyTorch + transformers libraries, which can be 2GB+ just for
   dependencies.

  What's new in v1.2.0:
  • Native SafeTensors support - loads .safetensors files directly in Rust
  • 2x faster model loading compared to traditional formats
  • Zero Python dependencies - pure Rust implementation
  • Still just a 5MB binary (vs 50MB+ alternatives like Ollama)
  • Full OpenAI API compatibility for drop-in replacement

  Technical details:
  - Built with native SafeTensors parsing (not Python bindings)
  - Memory-efficient tensor loading with bounds checking
  - Tested up to 100MB+ models with sub-second loading
  - Cross-platform: Windows, macOS (Intel/ARM), Linux
  - Supports mixed model formats (GGUF + SafeTensors)

  This bridges the gap between HuggingFace's model ecosystem and lightweight local deployment. You can now grab any SafeTensors model from HuggingFace and run it locally with just a single binary.

  GitHub: https://github.com/Michael-A-Kuykendall/shimmy
  Install: `cargo install shimmy`

  Happy to answer questions about the SafeTensors implementation or Rust AI inference in general!

A.I. As Normal Technology (Derogatory)

Show HN: UltraPlot. A Succinct Wrapper for Matplotlib

Is it possible that these two chips have hardware trojans in them?

Choosing a model for a research platform with real data and metrics

The AI Nerf Is Real

The [ASI] Problem

Dotter: Dotfile manager and templater written in Rust

Show HN: Llmswap – Universal AI SDK and Code Generation CLI

Electro-optical Mott neurons made of niobium dioxide

Charlie Kirk Shot at Utah Valley University

Cybercrooks ripped the wheels off at Jaguar Land Rover

I built one of the fastest real-time transcription apps for Mac

The AI that solved IMO Geometry Problems [video]

Flu jab email mishap exposes students' personal data

Standard Capital

Uncle Sam indicts alleged ransomware kingpin tied to $18B in damages

Show HN: Aras Finder – Create precise Boolean job search links

A 'universal' therapy against the seasonal flu?

You're more likely to reach for that soda when it's hot outside

The Top-Selling Cocktail System

How many federal agencies does it take to regulate AI? Enough to hold it back

Enabling enhanced security for your app in Xcode

Enhance your CLI testing workflow with the new dotnet test

New Posthog Website

Can LLMs replace on call SREs today?

Elon Musk just lost his title as richest person

Debian Experimental: for when Debian Unstable is too stable for you

U.S. Wildfire Fighters to Mask Up After Decades-Long Ban on Smoke Protections

Best practices for Vibe Coding in prod in one video

NASA hasn't found life on Mars yet – but signs are promising

Show HN: 5MB Rust binary that runs HuggingFace models (no Python)

A.I. As Normal Technology (Derogatory)

Show HN: UltraPlot. A Succinct Wrapper for Matplotlib

Is it possible that these two chips have hardware trojans in them?

Choosing a model for a research platform with real data and metrics

The AI Nerf Is Real

The [ASI] Problem

Dotter: Dotfile manager and templater written in Rust

Show HN: Llmswap – Universal AI SDK and Code Generation CLI

Electro-optical Mott neurons made of niobium dioxide

Charlie Kirk Shot at Utah Valley University

Cybercrooks ripped the wheels off at Jaguar Land Rover

I built one of the fastest real-time transcription apps for Mac

The AI that solved IMO Geometry Problems [video]

Flu jab email mishap exposes students' personal data

Standard Capital

Uncle Sam indicts alleged ransomware kingpin tied to $18B in damages

Show HN: Aras Finder – Create precise Boolean job search links

A 'universal' therapy against the seasonal flu?

You're more likely to reach for that soda when it's hot outside

The Top-Selling Cocktail System

How many federal agencies does it take to regulate AI? Enough to hold it back

Enabling enhanced security for your app in Xcode

Enhance your CLI testing workflow with the new dotnet test

New Posthog Website

Can LLMs replace on call SREs today?

Elon Musk just lost his title as richest person

Debian Experimental: for when Debian Unstable is too stable for you

U.S. Wildfire Fighters to Mask Up After Decades-Long Ban on Smoke Protections

Best practices for Vibe Coding in prod in one video

NASA hasn't found life on Mars yet – but signs are promising