Agentic AI Atlasby a5c.ai
OverviewWikiGraphFor AgentsEdgesSearchWorkspace
/
GitHubDocsDiscord
i.5Wiki
Agentic AI Atlas · huggingface/skills
docs/reference-repos/huggingface/skills/researcha5c.ai
Search the atlas/
Wiki · linked records

Article and nearby pages

I.Current articlepp. 1 - 1
I.
Wiki article

docs/reference-repos/huggingface/skills/research

Reading · 5 min

huggingface/skills reference

Hugging Face's official skill collection for AI/ML development workflows, compatible with Claude Code, Codex, Gemini CLI, and Cursor. Contains 11 specialized skills covering the complete ML pipeline: datasets, training (LLM and vision), evaluation, Gradio apps, paper publishing, tool building, and deployment. Follows the standardized Agent Skills format with multi-harness plugin support and includes MCP server integration.

Page nodewiki/docs/reference-repos/huggingface/skills/research.mdNearby pages · 0Documents · 0

huggingface/skills

  • **Archetype**: mega-skill-pack
  • **Stars**: 10,166
  • **Last pushed**: 2026-04-13
  • **License**: Apache-2.0
  • **Discovered**: 2026-04-13
  • **Source**: backlog-processing
  • **Skills found**: 11 (hf-cli, huggingface-community-evals, huggingface-datasets, huggingface-gradio, huggingface-llm-trainer, huggingface-paper-publisher, huggingface-papers, huggingface-tool-builder, huggingface-trackio, huggingface-vision-trainer, transformers-js)

Summary

Hugging Face's official skill collection for AI/ML development workflows, compatible with Claude Code, Codex, Gemini CLI, and Cursor. Contains 11 specialized skills covering the complete ML pipeline: datasets, training (LLM and vision), evaluation, Gradio apps, paper publishing, tool building, and deployment. Follows the standardized Agent Skills format with multi-harness plugin support and includes MCP server integration.

Assessment

HIGH VALUE. This is an authoritative mega-skill-pack from Hugging Face containing production-grade ML workflows. The skills encode detailed procedural knowledge for complex tasks like model training (SFT, DPO, GRPO, reward modeling), dataset preparation and validation, evaluation pipeline setup, and Gradio app deployment. The training skills particularly contain sophisticated infrastructure management, cost estimation, and monitoring procedures that are directly extractable as specializations/data-science-ml/ processes.

Extraction Priority

HIGH - Contains official Hugging Face workflows that are directly transferable:

  • ML model training pipelines -> specializations/data-science-ml/
  • Dataset preparation and validation workflows -> specializations/data-science-ml/
  • Model evaluation and benchmarking processes -> specializations/data-science-ml/
  • Gradio app development workflows -> specializations/frontend/ or specializations/data-science-ml/

Skills Inventory

SkillPathDomainTransferable?Notes
huggingface-llm-trainerskills/huggingface-llm-trainer/ML/DataYes - processTRL training methods (SFT, DPO, GRPO), infrastructure management
huggingface-datasetsskills/huggingface-datasets/ML/DataYes - processDataset preparation, validation, Hub integration
huggingface-gradioskills/huggingface-gradio/Frontend/MLYes - processInteractive ML app development and deployment
huggingface-community-evalsskills/huggingface-community-evals/ML/DataYes - processModel evaluation and benchmarking workflows
huggingface-vision-trainerskills/huggingface-vision-trainer/ML/DataYes - processComputer vision model training pipelines
huggingface-paper-publisherskills/huggingface-paper-publisher/ML/AcademicYes - patternResearch paper publishing and model documentation
transformers-jsskills/transformers-js/Frontend/MLYes - processBrowser-based ML model deployment workflows

Processes

- Source: skills/huggingface-llm-trainer/SKILL.md (lines 10-50) - Placement: specializations/data-science-ml/ - Inputs: Training data, model configuration, hardware requirements - Outputs: Trained model, training metrics, deployment artifacts - Complexity: complex - Notes: Covers SFT, DPO, GRPO, reward modeling, cost estimation, monitoring

  • **ml-model-training-pipeline**: Comprehensive workflow for training language models using TRL on Hugging Face infrastructure

- Source: skills/huggingface-datasets/SKILL.md - Placement: specializations/data-science-ml/ - Inputs: Raw data, schema requirements, quality criteria - Outputs: Validated dataset, metadata, quality report - Complexity: moderate

  • **dataset-preparation-workflow**: Systematic process for preparing and validating datasets for ML training

- Source: skills/huggingface-gradio/SKILL.md - Placement: specializations/data-science-ml/ - Inputs: Model, UI requirements, deployment target - Outputs: Interactive app, deployment configuration, user documentation - Complexity: moderate

  • **gradio-app-development**: End-to-end process for building interactive ML applications with Gradio

- Source: skills/huggingface-community-evals/SKILL.md - Placement: specializations/data-science-ml/ - Inputs: Model, evaluation datasets, benchmark criteria - Outputs: Performance metrics, comparison reports, leaderboard submissions - Complexity: moderate

  • **model-evaluation-pipeline**: Systematic approach to evaluating and benchmarking ML models

Plugin Ideas

- What install.md would do: Set up Gradio development environment, create app templates, configure deployment pipelines, install UI components - Processes it would copy: gradio-app-development, ml-model-integration - Configs/hooks it would create: Gradio templates, CSS themes, deployment scripts, monitoring configs - Source evidence: huggingface-gradio skill with interactive app development workflows

  • **gradio-app-builder**: Plugin for rapid ML application prototyping and deployment

Implicit Procedural Knowledge

- Source: huggingface-llm-trainer skill documentation and method comparison sections - Placement: specializations/data-science-ml/ - Why codify: Provides systematic decision framework for training method selection in ML projects - Sketch: Use case analysis -> Data type evaluation -> Method capability mapping -> Cost-performance trade-off -> Training method recommendation

  • **TRL Training Method Selection**: Process for choosing appropriate training method (SFT vs DPO vs GRPO) based on use case and data

- Source: Training skills' hardware selection and cost estimation guidance - Placement: specializations/data-science-ml/ - Why codify: Systematic approach to ML infrastructure planning that's reusable across cloud providers - Sketch: Model size analysis -> Training duration estimation -> Hardware requirements mapping -> Cost calculation -> Optimization recommendations

  • **ML Infrastructure Cost Estimation**: Process for estimating and optimizing training costs on cloud ML infrastructure

Library Mapping

Extractable ProcessLibrary StatusActionExisting PathTarget Placement
ML Model Training PipelineNEWTRL training methods (SFT, DPO, GRPO) with infrastructure management-specializations/data-science-ml/ml-model-training-pipeline.js
Dataset Preparation WorkflowNEWSystematic dataset preparation and validation for ML training-specializations/data-science-ml/dataset-preparation-workflow.js
Gradio App DevelopmentNEWEnd-to-end interactive ML application development process-specializations/data-science-ml/gradio-app-development.js
Model Evaluation PipelineNEWSystematic ML model evaluation and benchmarking methodology-specializations/data-science-ml/model-evaluation-pipeline.js
TRL Training Method SelectionNEWDecision framework for choosing SFT vs DPO vs GRPO training methods-specializations/data-science-ml/trl-training-method-selection.js
ML Infrastructure Cost EstimationNEWTraining cost estimation and optimization for cloud ML infrastructure-specializations/data-science-ml/ml-infrastructure-cost-estimation.js
Computer Vision Training PipelineNEWSpecialized training pipeline for computer vision models-specializations/data-science-ml/computer-vision-training-pipeline.js
Research Paper Publishing ProcessNEWML research paper publishing and model documentation workflow-specializations/data-science-ml/research-paper-publishing.js

Plugin Marketplace Mapping

Plugin IdeaMarketplace StatusActionExisting PluginTarget Placement
Gradio App BuilderNEWRapid ML application prototyping with templates and deployment automation-plugins/a5c/marketplace/plugins/gradio-app-builder/

Trail

Wiki
Babysitter Docs
Reference Repos

Huggingface

Skills

huggingface/skills

Page record

Open node ledger

wiki/docs/reference-repos/huggingface/skills/research.md

Documents

No documented graph nodes on this page.