frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Integrating Local Open LLMs (LLM-Jp) with MLflow Prompt Engineering UI

https://github.com/suzuki-2001/mlflow-llm-jp-integration
1•ss-13•1y ago

Comments

ss-13•1y ago
I’ve been experimenting with MLflow’s Prompt Engineering UI, which lets you do no-code prompt tuning across multiple LLMs. While it officially supports models like OpenAI out of the box, I wanted to try it with Japanese open-source models from the LLM-jp project.

This repo shows how to serve these models locally using MLflow’s pyfunc model interface, expose them via the MLflow AI Gateway, and compare prompt performance through the UI.

It includes a working setup with: - Hugging Face LLM-jp models (e.g. llm-jp-3-3.7b-instruct) - MLflow Model Serving - MLflow Gateway - Prompt Engineering UI - Streamlit UI for experiment tracking

GitHub: https://github.com/suzuki-2001/mlflow-llm-jp-integration Japanese article explaining the project: https://zenn.dev/shosuke_13/articles/21d304b5f80e00

Normetrics: A unified API for norm-based linear models (white paper)

https://github.com/PPENELLE/normetrics
1•ppenelle•31s ago•0 comments

Show HN: Junco, turn newsletters into short audio episodes

https://www.tryjunco.com/
1•alex-onecard•42s ago•0 comments

Generating Random Factored Numbers, Easily [pdf]

https://link.springer.com/content/pdf/10.1007/s00145-003-0051-5.pdf
1•luu•1m ago•0 comments

DeepSeek-V4-Flash (official FP8) running across 2x DGX Spark

https://forums.developer.nvidia.com/t/deepseek-v4-flash-official-fp8-running-across-2x-dgx-spark-...
1•pilooch•1m ago•0 comments

FBI charges two NIH researchers with smuggling monkeypox to US from Congo

https://www.justice.gov/usao-edmi/pr/feds-charge-foreign-nationals-working-national-institutes-he...
1•delichon•2m ago•0 comments

Python The Good Stuff: Humble Book Bundle

https://www.humblebundle.com/books/python-good-stuff-no-starch-books
1•teleforce•2m ago•0 comments

Use your Nvidia GPU's VRAM as swap space on Linux

https://github.com/c0dejedi/nbd-vram
2•tanelpoder•5m ago•0 comments

FullPAC files S-1 [pdf]

https://d1io3yog0oux5.cloudfront.net/gotv/sec/0001493152-26-026911/0001493152-26-026911.pdf
1•naryJane•6m ago•1 comments

Always Be Blaming: how Git blame answers the wrong question

https://matklad.github.io/2026/05/18/always-be-blaming.html
1•pgedge_postgres•11m ago•0 comments

Show HN: Reloops – Open-Source Frame.io Alternative for AI Agents and Teams

https://github.com/Reloops-App/reloops/
1•dheerajbhatia27•11m ago•0 comments

Show HN: Ordinary and Ordinaryd v0.6.0

https://codeberg.org/ordinarylabs/Ordinary/src/branch/main/docs/quick-start.md
1•seanwatters•12m ago•0 comments

Feds failing in bid to take a supercomputer from a climate research center

https://arstechnica.com/science/2026/06/judge-blocks-part-of-trump-admins-effort-to-hurt-colorado...
1•yodon•13m ago•0 comments

I hadn't coded in 30 years. Then I built a space game with Godot

2•CosmicGoldRush•14m ago•0 comments

AI enthusiasts are in race against time, AI skeptics are in race against entropy

https://charitydotwtf.substack.com/p/ai-enthusiasts-are-in-a-race-against
1•wapasta•14m ago•0 comments

Eupago for Python – The First Python SDK for Portugal's MB Way/Multibanco

https://github.com/bilouro/eupago-python
1•bilouro•16m ago•0 comments

This creepy blob robot will keep going even if you break its legs

https://www.popsci.com/technology/unstoppable-blob-robot/
1•mhb•21m ago•1 comments

Law Professors Prefer AI over Peer Answers [pdf]

https://law.stanford.edu/wp-content/uploads/2026/06/salinas_et_al.pdf
1•droidjj•23m ago•0 comments

Titan Network claims 5% of Asia's AI data market using crowdsourced home devices

https://www.coindesk.com/tech/2026/06/02/here-s-how-one-decentralized-cloud-provider-says-private...
1•Reaktornano•24m ago•1 comments

Paseo – Beautiful open-source coding agent interface (desktop, mobile, CLI)

https://github.com/getpaseo/paseo
3•timhigins•26m ago•1 comments

The Empty Field That Wasn't: GPS, OTAD and Two Decades of Encrypted Broadcasts

https://lsc-pagepro.mydigitalpublication.com/publication/?i=865273&p=62&view=issueViewer
1•ahlCVA•27m ago•0 comments

WinUtils: Shell-powered CLI tools for Windows 95

https://www.codenaked.com/winutils
2•code_naked•27m ago•1 comments

We tore down our no-code site and went back to code

https://twitter.com/chrismuccioli/status/2061909833893257389
4•nadis•28m ago•1 comments

ContextWall – Context firewall for AI agents and RAG pipelines

https://contextwall.io/
2•sumeshpk•29m ago•0 comments

Show HN: Scholar Sidekick – citation verifier for the "real DOI, wrong paper"

https://scholar-sidekick.com
1•ProductivePhys•30m ago•1 comments

Dense Retrievers Know More Than They Can Express

https://www.mixedbread.com/blog/latent-terms
1•emschwartz•31m ago•0 comments

Android gets fake-call detection for spoofed calls

https://arstechnica.com/gadgets/2026/06/google-announces-deepfake-call-detection-for-android-new-...
1•Lihh27•33m ago•0 comments

Understand your AI generated code

https://archtocode.com/blog
1•grzelazny•37m ago•0 comments

4K years ago, Mohenjo-daro grew more equal over time

https://archaeologymag.com/2026/05/mohenjo-daro-grew-more-equal-over-time/
3•marojejian•38m ago•0 comments

A harness for every task: dynamic workflows in Claude Code

https://claude.com/blog/a-harness-for-every-task-dynamic-workflows-in-claude-code
2•cebert•38m ago•0 comments

JavaScript Crossword

https://lyra.horse/fun/jscrossword/
2•patrikcsak•40m ago•0 comments