Show HN: Agent Skills Leaderboard

skills.sh

131 points

andrewqu

2 days ago


44 comments

dave1010uk a day ago

The install is very opaque. It's not clear where these skills are installed, how to upgrade them or remove them.

Here's the `skills` package on NPM: https://www.npmjs.com/package/skills - it's MIT licensed but I can't find it on Github.

`skills` looks to be a wrapper around `add-skill`: https://github.com/vercel-labs/add-skill

From the docs, `add-skill` auto detects from 16 different potential paths to copy skills to in a repository (.claude/, .codex/, .Gemini/, etc).

`add-skill` also let's you install skills globally (~/). From the code, `skills` looks like it doesn't support global installs but under the hood it passes all args to add-skill, so you should be able to install skills globally or install multiple skills (even if the wrapper doesn't expect it).

Aside: although lots of agents have adopted SKILLS.md conventions, they're currently all using their own paths. There doesn't seem to be a consensus yet, like there is with AGENTS.md. There are even 3 generic paths: .agent/skills/, .agents/skills/ and just skills/

  • _august a day ago

    Although it's not clear how to upgrade them (I doubt there is any version management built-in), the installer does specify where it will be installed (And lets you choose global/project level.

  • Traubenfuchs a day ago

    Skills certainly got a wildwest pre VCS copy-paste feeling going on.

laborcontract 2 days ago

As someone who has found skills useful, seeing skills like this[0] raises the same question about (a subset of) skills as did MCP: why not just have the agent run ‘tool --help’?

https://skills.sh/ubie-inc/agent-skills/codex

  • zbyforgotp a day ago

    The point is for the agent to have an index of available skills so that it can decide autonomously if it needs to load one.

    https://zzbbyy.substack.com/p/what-are-skills

    In my experience it doesn’t work too well with codex, but I expect llm providers to train them on that use case and improve the situation soon.

  • solumunus a day ago

    Not all flags will be relevant, you can trim down and contextualise the instructions for your specific project.

straydusk 2 days ago

What is this? How does it work? How are skills ranked? Seems a little bit fishy to me that you can only tell it's from Vercel if you click the top left corner, and the top two skills come from vercel... despite there definitely being much more used skills in the overall AI coding ecosystem.

The UI looks nice, otherwise. I had thought about building something like this - maybe this just increases my confidence that this is needed, just not affiliated with a company.

  • andrewqu a day ago

    skills are ranked by anonymous telemetry from running `npx skills add <owner/repo>`

    Vercel's skills are popularly installed because we initially launched `npx skills` with the launch of our `react-best-practices`

    But have been developing the tool in tandem!

  • embedding-shape 2 days ago

    > How are skills ranked?

    By npm weekly installs (??). Famously good signal for quality.

    Edit: Not even npm, their own tools download count...

amadeuswoo a day ago

Honest question: has anyone found skills that fundamentally changed their workflow vs. ones that are just ‘nice to have’? Curious what the actual power-user stack looks like.

Anyways, great work on this btw, the agent-agnostic approach is the right call

  • theshrike79 3 hours ago

    Basically skills are /commands that can have attached scripts, that's about it.

    If your "skill" doesn't come with scripts/executables, it's just a fancy slash command.

    I've had success with code quality analysis skills that use PEP723 Python scripts with tree-sitter to actually analyse the code structure as an AST. The script output and switches are specifically optimised for LLM use, no extra fluff - just condensed content with the exact data needed.

  • zby a day ago

    I would double - do skills reliably work for you? I mean are they reliably injected when there is a need, as opposed to being actively called for (which in my opinion defeats the purpose of skills - because I can always ask the llm to read a document and then do something with the new knowledge).

    I have a feeling that codex still does not do it reliably - so I still have normal README files which it loads quite intelligently and it works better than the discovery via skills.

  • esperent a day ago

    > power-user stack

    Try installing the Claude Superpowers skills - you can install them one by one from here, but it's easier to install the superpowers plugin. Try using it for a couple of sessions and see how it works for you.

    For a full test, try starting with the brainstorming one which then guides you from brainstorming though planning, development etc.

    I've been using it for a few days and I would say it's enhanced my workflows at least.

  • NitpickLawyer a day ago

    One simple but useful flow is to ask cc to review a session and find miss-matches between initial skills / agent.md and current session, and propose an edit. I then skim over it and add it. It feels like it helps, but I don't have quantitative data yet.

  • flwi a day ago

    Define "fundamental", but I added skills to run complicated evaluation procedures for my ML research. This way I can open 5 CC instances, let them run and iterate on research without intervention. After that, I can do the final review.

  • rudedogg a day ago

    My experience with them is limited, but I’m having issues with the LLMs ignoring the skills content. I guess it makes sense, it’s like any other piece of context.

    But it’s put a damper in my dream of constraining them with well crafted skills, and producing high quality output.

    • amadeuswoo a day ago

      Yeah, the context window is a blunt instrument, everything competes for attention. I get better luck with shorter, more opinionated skills that front-load the key constraints vs. comprehensive docs that get diluted. Also explicitly invoking them (use the X skill) seems to help vs hoping they get picked up automatically

      • SOLAR_FIELDS a day ago

        Yes, unfortunately the most reliable way is to inject them into the user prompt at a fresh session. My guess is that biasing towards checking for the tools availability too much affects performance, which might explain why it is quite rarer that I see it just choose to use a skill without previous prompting.

    • KingMob a day ago

      Yeah, I'm still trying to figure out why some skills are used every day, while others are constantly ignored. I suspect partially overlapping skill areas might confuse it.

      I've added a UserPromptSubmit hook that does basic regex matches on requests, and tries to interject a tool suggestion, but it's still not foolproof.

saberience a day ago

So this is a page made by Vercel which conveniently puts the Vercel skill as top of the list.

Nice.

The page should properly advertize that this is a corporate built/sponsored site so people can understand that it's totally biased.

Johnny_Bonk a day ago

Nice work! I don't think Vercel is the first to do this, but it's a good idea and I'm glad to see more players in this space.

A small UI suggestion: it would be helpful if hovering on a row showed the skill description, along with a button to copy the install command.

For anyone interested, there are two other sites already doing something similar:

- claudemarketplaces.com - A comprehensive directory with 1900+ marketplaces, shows descriptions directly in the list view with copy-to-install commands

- skillsmp.com - Has 77K+ skills indexed from GitHub. Cool developer-style UI, but honestly the UX could use work—the search is hidden behind cryptic command-style buttons and it's not obvious how to actually search

Also worth checking out the Claude Code Mastery guide (thedecipherist.github.io/claude-code-mastery) for a deeper dive into skills, hooks, MCP, and CLAUDE.md.

joshribakoff a day ago

This is a really good implementation, but I don’t lean too heavily into skills especially not other people‘s. If I’m doing design who’s to say I want instructions in there in the first place like “pick an extreme“ (instructions in the design skill featured on the homepage)

m-hodges a day ago

Why do none of these “npm for Skills” document any way to do basic package management things like updates, version-pinning, or even uninstalls?

  • giancarlostoro a day ago

    ;) because you can ask Claude to do all of those things, but which package manager? there's a dozen ways to install packages, will Claude pick the right one you wanted? Your skill will guarantee consistency

    • m-hodges a day ago

      I don’t understand this reply at all. If I use this skills.sh tool to install the Foo Skill for one or more agent harness, how do I update Foo later?

      > but which package manager?

      The one being linked to.

      • giancarlostoro 21 hours ago

        I misunderstood your question, thought you were confused about a skill to enforce NPM usage by Claude to use NPM strictly as opposed to another package manager. Last time I post on HN past midnight.

toledocavani 2 days ago

The leaderboard is ranked by the weekly download count by their "npx skills" command. This is Vercel new "standard" skills installer so obvious their skills are at the top.

testfrequency a day ago

I wish I knew why my skills are never called…including my custom sub agents.

Maybe it’s my own ignorance, but Claude loves to ignore its CLAIDE.MD which says it’s mandatory to leverage sub agents to delegate tasks and use skills for accomplish specific workflows.

Every time I call Claude out it tells me it knows and chose to ignore it, even going as far as saying it’s not my decision.

Any tips?

  • chewz a day ago

    Create a hook that would ask Claude Code to evaluate all skills in the project and decide which are applicable to the current task at hand. It is easy and works very well.

    Forget Claude.md

    • testfrequency a day ago

      I can’t thank you enough for course correcting me here, this is exactly what I should have been doing. I did some reading and I clearly wasn’t using any of the features correctly or efficiently.

      My Claude.md was nearly 1,900 lines. It’s down to 150 lines now with Skills fully built out for the agents, and a hook to steer the ship. It’s all working perfectly now.

      Thank you again!

      • chewz 14 hours ago
        2 more

        You are welcome. I have myself picked up this trick reading through Reditt and later polished that myself. No official guide is suggesting that

        • testfrequency 12 hours ago

          Now that it’s set up it feels obvious, but I agree nowhere was this clear to me that this is how skills, hooks, and subagents were intended to be configured.

          I was so much more productive today and used far less tokens. Thank you again!