frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Integrating Local Open LLMs (LLM-Jp) with MLflow Prompt Engineering UI

https://github.com/suzuki-2001/mlflow-llm-jp-integration
1•ss-13•1y ago

Comments

ss-13•1y ago
I’ve been experimenting with MLflow’s Prompt Engineering UI, which lets you do no-code prompt tuning across multiple LLMs. While it officially supports models like OpenAI out of the box, I wanted to try it with Japanese open-source models from the LLM-jp project.

This repo shows how to serve these models locally using MLflow’s pyfunc model interface, expose them via the MLflow AI Gateway, and compare prompt performance through the UI.

It includes a working setup with: - Hugging Face LLM-jp models (e.g. llm-jp-3-3.7b-instruct) - MLflow Model Serving - MLflow Gateway - Prompt Engineering UI - Streamlit UI for experiment tracking

GitHub: https://github.com/suzuki-2001/mlflow-llm-jp-integration Japanese article explaining the project: https://zenn.dev/shosuke_13/articles/21d304b5f80e00

Spontaneous Human Combustion over the Ages

https://www.fatemag.com/post/spontaneous-human-combustion-over-the-ages
1•austinallegro•54s ago•0 comments

APL Performance

https://aplwiki.com/wiki/Performance
1•tosh•4m ago•0 comments

You can't whisper at an AI agent

https://stripe.dev/blog/ai-steering-experiments
1•logickkk1•4m ago•0 comments

The vibes in SF feel frenetic – The divide in outcomes is massive

https://twitter.com/deedydas/status/2055491938464489888
1•spenvo•5m ago•0 comments

Polymarket Isn't a Prediction Market for the Real World

https://reticulating.substack.com/p/polymarket-isnt-a-prediction-market
1•fontain•9m ago•0 comments

Mike Stonebraker: Computer Science might not be a growth industry going forward

https://www.youtube.com/watch?v=YPObBOwIrHk
1•tcp_handshaker•10m ago•1 comments

Texas county passes 1-year data center construction ban

https://www.politico.com/news/2026/05/16/texas-county-data-center-construction-ban-00922493
2•cdrnsf•12m ago•0 comments

Beaver: An Enterprise Benchmark for Text-to-SQL

https://peterbaile.github.io/beaver/
1•tcp_handshaker•14m ago•0 comments

Rx Inspector: Look Up Where Your Generic Prescription Drugs Were Made

https://projects.propublica.org/rx-inspector/
3•turtleyacht•19m ago•0 comments

The 48-Hour Cancer Binder

https://ludocomito.dev/blog/48-hour-cancer-binder/
2•ludocomito•21m ago•0 comments

Palace-AI – memory palace for AI agents

https://github.com/RhyChaw/palace-ai
3•RhythmC•22m ago•0 comments

A Note On Distributed Computing (1994)[pdf]

https://waldo.scholars.harvard.edu/sites/g/files/omnuum6261/files/waldo/files/waldo-94.pdf
2•ai_critic•32m ago•0 comments

The clean-up cost of AI code is what the velocity narrative leaves out

https://webflow.com/blog/cleanup-cost-ai-generated-code
4•dxs•33m ago•0 comments

Halt and Catch Fire

https://unstack.io/halt-and-catch-fire
4•ScottWRobinson•36m ago•0 comments

Tata Electronics and ASML Announce Strategic Partnership

https://www.asml.com/en/news/press-releases/2026/tata-electronics-and-asml-announce-strategic-par...
3•saharshpruthi•38m ago•1 comments

As Russian drones hunt Ukrainians in 'human Safari,' a boy fought back

https://www.washingtonpost.com/world/2026/05/15/russian-drones-hunt-ukrainians-human-safari-boy-f...
4•Stratoscope•38m ago•1 comments

Thousands todo apps, but none allows one-click collaboration with my grandma

https://medium.com/@tombarys/dotolist-creating-a-one-click-team-065a18dbeccd
3•tombarys•40m ago•1 comments

Elevated error rates on requests to multiple models

https://status.claude.com/incidents/v9s6d0jt84hj
9•recroad•41m ago•2 comments

Show HN: Daily vibe-coding video games, day 33: Tower Defense (single prompt)

https://gamevibe.us/33-tower-defense
4•pzxc•41m ago•0 comments

CC: Anthropic API Error: 500 Internal Server Error

https://github.com/anthropics/claude-code/issues/59743
5•brenoRibeiro706•43m ago•1 comments

Nvidia Preparing RTX 5090 Price Increase Due to Rising GDDR7 Memory Costs

https://mp1st.com/news/nvidia-preparing-rtx-5090-price-increase
3•jeffufl•43m ago•0 comments

Feeds: A Minimal RSS Aggregator and Client

https://stevedylan.dev/posts/feeds/
3•speckx•45m ago•0 comments

Malta gives citizens a paid version of ChatGPT Plus for free

https://ranked.news/malta-gives-citizens-a-paid-version-of-chatgpt-plus-for-free
2•doener•47m ago•0 comments

I 3D Printed Origami [video]

https://www.youtube.com/watch?v=FNVBK7-h9Fs
7•Teever•49m ago•1 comments

Haiku boots to desktop on an M1 MacBook Air

https://discuss.haiku-os.org/t/my-haiku-arm64-progress/19044?page=2
3•calgarymicro•51m ago•1 comments

Hermes-agentmemory: pull-model episodic memory with real deletes

https://github.com/MukundaKatta/hermes-agentmemory
2•mukundakatta•52m ago•0 comments

US Is Starting to See Heavy Job Losses in Roles Exposed to AI

https://www.bloomberg.com/news/articles/2026-05-15/us-is-starting-to-see-heavy-job-losses-in-role...
23•elsewhen•53m ago•3 comments

A brief guide to self-hosting websites and apps using Cloudflare Tunnel

https://blog.dougbelshaw.com/cloudflare-tunnel/
2•speckx•56m ago•0 comments

Rubin Tracks Skyscraper-Size Asteroids, Supernovas, and Interstellar Visitors

https://www.quantamagazine.org/rubin-tracks-skyscraper-size-asteroids-failed-supernovas-and-inter...
1•rbanffy•1h ago•0 comments

OpenIQ - Building a product engineering muscle in the age of agents

https://abhirame.github.io/posts/openiq/
2•abhis3798•1h ago•0 comments