frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Integrating Local Open LLMs (LLM-Jp) with MLflow Prompt Engineering UI

https://github.com/suzuki-2001/mlflow-llm-jp-integration
1•ss-13•1y ago

Comments

ss-13•1y ago
I’ve been experimenting with MLflow’s Prompt Engineering UI, which lets you do no-code prompt tuning across multiple LLMs. While it officially supports models like OpenAI out of the box, I wanted to try it with Japanese open-source models from the LLM-jp project.

This repo shows how to serve these models locally using MLflow’s pyfunc model interface, expose them via the MLflow AI Gateway, and compare prompt performance through the UI.

It includes a working setup with: - Hugging Face LLM-jp models (e.g. llm-jp-3-3.7b-instruct) - MLflow Model Serving - MLflow Gateway - Prompt Engineering UI - Streamlit UI for experiment tracking

GitHub: https://github.com/suzuki-2001/mlflow-llm-jp-integration Japanese article explaining the project: https://zenn.dev/shosuke_13/articles/21d304b5f80e00

Show HN: Matrix Engine WGPU – focus on mobile]

https://maximumroulette.com/apps/webgpu/examples.html
1•zla•54s ago•0 comments

Ask HN: What is the meaning of life? Why are we here?

1•chistev•2m ago•0 comments

21-year-old Stanford grad raised $11M to put a hormone lab on your wrist

https://fortune.com/2026/06/17/clair-khosla-anne-wojcicki-wearble-oura-whoop-hormone-women-health/
1•brandonb•2m ago•0 comments

Anthropic: First AI startup in Frontier carbon removal coalition

https://techcrunch.com/2026/06/17/anthropic-becomes-first-ai-startup-to-join-the-frontier-carbon-...
2•mchusma•3m ago•1 comments

Full duplex for dummies – how Moshi implemented full duplex

https://www.frisson-labs.com/moshi-for-mere-mortals
1•ymaws•6m ago•1 comments

Economist on How High Inflation Takes Time to Build Up [video]

https://www.youtube.com/watch?v=7iYnIjcu9l8
1•mooreds•9m ago•0 comments

How trust funds made the modern world

https://springbett.substack.com/p/in-progress-we-trust
1•cainxinth•11m ago•0 comments

Mathematics Dating Simulator

https://twitter.com/akuicia/status/2067096575172899015
2•enthdegree•12m ago•0 comments

The Generative AI Learning Penalty: Evidence from Chinese Secondary Education

https://cepr.org/publications/dp21577
1•obscurette•12m ago•0 comments

Programmers at Work

https://www.programmersatwork.net
1•rbanffy•12m ago•0 comments

Five Things We Learned from Warsh's First Fed Meeting

https://universwhat.blogspot.com/2026/06/five-things-we-learned-from-warshs.html
1•Reset_freeze•15m ago•1 comments

Cwmail – a fast, keyboard-driven terminal email client in Go, AI-powered replies

https://mail.intellios.ai/
1•coolwulf•15m ago•0 comments

Apple to Raise Prices Due to Memory Chip Crunch

https://www.wsj.com/tech/apple-price-increases-memory-supply-199845b1
4•foobarqux•16m ago•0 comments

ToolSchema Kit – reproducible MCP contract drift lab (10-min tutorial)

https://github.com/kioie/toolschema-kit
1•driftguard•17m ago•0 comments

AI medical tools match or surpass doctors for advice

https://www.ft.com/content/734a45ee-86c4-47e1-8323-569bc14dcdd7
2•aanet•17m ago•1 comments

American Brahman Facts for Kids

https://kids.kiddle.co/American_Brahman
1•kamaraju•18m ago•0 comments

Only half of US datacenter capacity planned is under construction

https://www.theregister.com/on-prem/2026/06/17/only-half-of-us-datacenter-capacity-planned-for-20...
4•Bender•19m ago•1 comments

A Town Square for my site

https://cauenapier.com/blog/townsquare/
1•tlocrt•19m ago•0 comments

Did u get the transmission that a company called Cerebras converted to a dog...

1•cryptcrsswrd•20m ago•0 comments

Tesco moving 40k server workloads off VMware amid Broadcom's abusive conduct

https://arstechnica.com/information-technology/2026/06/tesco-moving-40000-server-workloads-off-vm...
4•Bender•21m ago•0 comments

California says AT&T lied to FCC in attempt to shut off old phone network

https://arstechnica.com/tech-policy/2026/06/california-says-att-lied-to-fcc-in-attempt-to-shut-of...
2•Bender•21m ago•0 comments

A Robot Is Sprinting Towards You: Do You Want It Running on Claude or Grok?

https://openrouter.ai/blog/insights/royale-last-agent-standing/
6•Usu•21m ago•0 comments

Odyssey $310M Fundraise to Accelerate World Simulation

https://odyssey.ml/our-series-b
2•ilreb•22m ago•0 comments

AI agent hiring Human to ship ugc videos (at AI price)

https://klaylab.com/
1•zya_wei•22m ago•1 comments

Show HN: ESP32 512kB – Tailscale, English to Python LLM and 8 containers local

https://punnerud.github.io/pyspell/
1•punnerud•23m ago•0 comments

Congress has more power than it thinks

https://www.lawfaremedia.org/article/congress-has-more-power-than-it-thinks
2•softwaredoug•28m ago•0 comments

Stephen Kotkin: Can America Still Lead the World? [video]

https://www.youtube.com/watch?v=gBEdxb8ei_0
2•simonebrunozzi•29m ago•0 comments

The $3 Bag That Broke the Internet

https://universwhat.blogspot.com/2026/06/the-3-bag-that-broke-internet-forum.html
1•Reset_freeze•30m ago•1 comments

Ask HN: How to deal with UI within the agentic loop

2•mattsadowsky•33m ago•1 comments

Model Card: unsloth/GLM-5.2-GGUF

https://huggingface.co/unsloth/GLM-5.2-GGUF
1•recroad•34m ago•0 comments