Skills-kit/Framework for AI-generated, testable automation skills for every LLM

https://github.com/gabrielekarra/skills-kit

1•gabrielekarra•1mo ago

Comments

gabrielekarra•1mo ago

Hey HN! I built Skills-Kit, a TypeScript framework that lets you create, validate, and bundle self-contained "skills" – think of them as portable automation modules that AI agents (or humans) can execute. The Problem: Most AI agent frameworks treat code execution as an afterthought. You get either sandboxed-but-limited environments or full system access with zero safety. Plus, sharing and versioning agent capabilities is a mess. Skills-Kit's approach:

Each skill is a folder: metadata (YAML), a deterministic Node.js entrypoint, declarative security policies, and golden tests Built-in linting validates structure and security declarations Golden test runner ensures skills behave correctly AI-powered creation: Use Claude (or mock templates) to generate skills from natural language Bundle and distribute skills as validated packages

What makes it interesting:

Security-first: skills declare what they need (network, filesystem, exec) upfront via policy.yaml Testable: golden tests catch regressions before deployment Provider-agnostic: works with Anthropic's API today, designed to support other LLMs Composable: skills can call other skills (orchestration primitives)

Current state: Early (v0.1.0), interfaces may evolve. Looking for feedback on:

The skill format itself – too verbose? missing something critical? Security model – how would you enforce policies at runtime? Use cases I'm missing – what would you build with this?

I'm not running a hosted service (yet?) – this is CLI/library tooling you run locally. The goal is to make "agentic capabilities" as shareable and reliable as npm packages.

GitHub: https://github.com/gabrielekarra/skills-kit Would love to hear what you think, especially from folks building agent systems. What's your experience with code generation and execution safety?

In the AI age, 'slow and steady' doesn't win

Administration won't let student deported to Honduras return

How were the NIST ECDSA curve parameters generated? (2023)

AI, networks and Mechanical Turks (2025)

Goto Considered Awesome [video]

Show HN: I Built a Free AI LinkedIn Carousel Generator

Implementing Auto Tiling with Just 5 Tiles

Open Challange (Get all Universities involved

Apple Tried to Tamper Proof AirTag 2 Speakers – I Broke It [video]

Show HN: Vibe as a Code / VaaC – new approach to vibe coding

Show HN: More beautiful and usable Hacker News

Toledo Derailment Rescue [video]

War Department Cuts Ties with Harvard University

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

A Bid-Based NFT Advertising Grid

AI readability score for your documentation

NASA Study: Non-Biologic Processes Don't Explain Mars Organics

I inhaled traffic fumes to find out where air pollution goes in my body

X said it would give $1M to a user who had previously shared racist posts

155M US land parcel boundaries

Private Inference

Font Rendering from First Principles

Show HN: Seedance 2.0 AI video generator for creators and ecommerce

Wally: A fun, reliable voice assistant in the shape of a penguin

Rewriting Pycparser with the Help of an LLM

Lobsters Vibecoding Challenge

E-Commerce vs. Social Commerce

Avoiding Modern C++ – Anton Mikhailov [video]

Show HN: AegisMind–AI system with 12 brain regions modeled on human neuroscience

Zig – Package Management Workflow Enhancements