Agentic AI Atlas

II.

Page JSON

page:docs-reference-repos-huggingface-skills-research

Structured · live

huggingface/skills json

Inspect the normalized record payload exactly as the atlas UI reads it.

File · wiki/docs/reference-repos/huggingface/skills/research.mdCluster · wiki

Record JSON

{
  "id": "page:docs-reference-repos-huggingface-skills-research",
  "_kind": "Page",
  "_file": "wiki/docs/reference-repos/huggingface/skills/research.md",
  "_cluster": "wiki",
  "attributes": {
    "nodeKind": "Page",
    "sourcePath": "docs/reference-repos/huggingface/skills/research.md",
    "sourceKind": "repo-docs",
    "title": "huggingface/skills",
    "displayName": "huggingface/skills",
    "slug": "docs/reference-repos/huggingface/skills/research",
    "articlePath": "wiki/docs/reference-repos/huggingface/skills/research.md",
    "article": "\n# huggingface/skills\n\n- **Archetype**: mega-skill-pack\n- **Stars**: 10,166\n- **Last pushed**: 2026-04-13\n- **License**: Apache-2.0\n- **Discovered**: 2026-04-13\n- **Source**: backlog-processing\n- **Skills found**: 11 (hf-cli, huggingface-community-evals, huggingface-datasets, huggingface-gradio, huggingface-llm-trainer, huggingface-paper-publisher, huggingface-papers, huggingface-tool-builder, huggingface-trackio, huggingface-vision-trainer, transformers-js)\n\n## Summary\nHugging Face's official skill collection for AI/ML development workflows, compatible with Claude Code, Codex, Gemini CLI, and Cursor. Contains 11 specialized skills covering the complete ML pipeline: datasets, training (LLM and vision), evaluation, Gradio apps, paper publishing, tool building, and deployment. Follows the standardized Agent Skills format with multi-harness plugin support and includes MCP server integration.\n\n## Assessment\nHIGH VALUE. This is an authoritative mega-skill-pack from Hugging Face containing production-grade ML workflows. The skills encode detailed procedural knowledge for complex tasks like model training (SFT, DPO, GRPO, reward modeling), dataset preparation and validation, evaluation pipeline setup, and Gradio app deployment. The training skills particularly contain sophisticated infrastructure management, cost estimation, and monitoring procedures that are directly extractable as specializations/data-science-ml/ processes.\n\n## Extraction Priority\nHIGH - Contains official Hugging Face workflows that are directly transferable:\n- ML model training pipelines -> specializations/data-science-ml/\n- Dataset preparation and validation workflows -> specializations/data-science-ml/\n- Model evaluation and benchmarking processes -> specializations/data-science-ml/\n- Gradio app development workflows -> specializations/frontend/ or specializations/data-science-ml/\n\n## Skills Inventory\n\n| Skill | Path | Domain | Transferable? | Notes |\n|-------|------|--------|---------------|-------|\n| huggingface-llm-trainer | skills/huggingface-llm-trainer/ | ML/Data | Yes - process | TRL training methods (SFT, DPO, GRPO), infrastructure management |\n| huggingface-datasets | skills/huggingface-datasets/ | ML/Data | Yes - process | Dataset preparation, validation, Hub integration |\n| huggingface-gradio | skills/huggingface-gradio/ | Frontend/ML | Yes - process | Interactive ML app development and deployment |\n| huggingface-community-evals | skills/huggingface-community-evals/ | ML/Data | Yes - process | Model evaluation and benchmarking workflows |\n| huggingface-vision-trainer | skills/huggingface-vision-trainer/ | ML/Data | Yes - process | Computer vision model training pipelines |\n| huggingface-paper-publisher | skills/huggingface-paper-publisher/ | ML/Academic | Yes - pattern | Research paper publishing and model documentation |\n| transformers-js | skills/transformers-js/ | Frontend/ML | Yes - process | Browser-based ML model deployment workflows |\n\n## Processes\n- **ml-model-training-pipeline**: Comprehensive workflow for training language models using TRL on Hugging Face infrastructure\n  - Source: skills/huggingface-llm-trainer/SKILL.md (lines 10-50)\n  - Placement: specializations/data-science-ml/\n  - Inputs: Training data, model configuration, hardware requirements\n  - Outputs: Trained model, training metrics, deployment artifacts\n  - Complexity: complex\n  - Notes: Covers SFT, DPO, GRPO, reward modeling, cost estimation, monitoring\n\n- **dataset-preparation-workflow**: Systematic process for preparing and validating datasets for ML training\n  - Source: skills/huggingface-datasets/SKILL.md\n  - Placement: specializations/data-science-ml/\n  - Inputs: Raw data, schema requirements, quality criteria\n  - Outputs: Validated dataset, metadata, quality report\n  - Complexity: moderate\n\n- **gradio-app-development**: End-to-end process for building interactive ML applications with Gradio\n  - Source: skills/huggingface-gradio/SKILL.md\n  - Placement: specializations/data-science-ml/\n  - Inputs: Model, UI requirements, deployment target\n  - Outputs: Interactive app, deployment configuration, user documentation\n  - Complexity: moderate\n\n- **model-evaluation-pipeline**: Systematic approach to evaluating and benchmarking ML models\n  - Source: skills/huggingface-community-evals/SKILL.md\n  - Placement: specializations/data-science-ml/\n  - Inputs: Model, evaluation datasets, benchmark criteria\n  - Outputs: Performance metrics, comparison reports, leaderboard submissions\n  - Complexity: moderate\n\n## Plugin Ideas\n- **gradio-app-builder**: Plugin for rapid ML application prototyping and deployment\n  - What install.md would do: Set up Gradio development environment, create app templates, configure deployment pipelines, install UI components\n  - Processes it would copy: gradio-app-development, ml-model-integration\n  - Configs/hooks it would create: Gradio templates, CSS themes, deployment scripts, monitoring configs\n  - Source evidence: huggingface-gradio skill with interactive app development workflows\n\n## Implicit Procedural Knowledge\n- **TRL Training Method Selection**: Process for choosing appropriate training method (SFT vs DPO vs GRPO) based on use case and data\n  - Source: huggingface-llm-trainer skill documentation and method comparison sections\n  - Placement: specializations/data-science-ml/\n  - Why codify: Provides systematic decision framework for training method selection in ML projects\n  - Sketch: Use case analysis -> Data type evaluation -> Method capability mapping -> Cost-performance trade-off -> Training method recommendation\n\n- **ML Infrastructure Cost Estimation**: Process for estimating and optimizing training costs on cloud ML infrastructure\n  - Source: Training skills' hardware selection and cost estimation guidance\n  - Placement: specializations/data-science-ml/\n  - Why codify: Systematic approach to ML infrastructure planning that's reusable across cloud providers\n  - Sketch: Model size analysis -> Training duration estimation -> Hardware requirements mapping -> Cost calculation -> Optimization recommendations\n\n## Library Mapping\n\n| Extractable Process | Library Status | Action | Existing Path | Target Placement |\n|-------------------|----------------|--------|---------------|------------------|\n| ML Model Training Pipeline | NEW | TRL training methods (SFT, DPO, GRPO) with infrastructure management | - | specializations/data-science-ml/ml-model-training-pipeline.js |\n| Dataset Preparation Workflow | NEW | Systematic dataset preparation and validation for ML training | - | specializations/data-science-ml/dataset-preparation-workflow.js |\n| Gradio App Development | NEW | End-to-end interactive ML application development process | - | specializations/data-science-ml/gradio-app-development.js |\n| Model Evaluation Pipeline | NEW | Systematic ML model evaluation and benchmarking methodology | - | specializations/data-science-ml/model-evaluation-pipeline.js |\n| TRL Training Method Selection | NEW | Decision framework for choosing SFT vs DPO vs GRPO training methods | - | specializations/data-science-ml/trl-training-method-selection.js |\n| ML Infrastructure Cost Estimation | NEW | Training cost estimation and optimization for cloud ML infrastructure | - | specializations/data-science-ml/ml-infrastructure-cost-estimation.js |\n| Computer Vision Training Pipeline | NEW | Specialized training pipeline for computer vision models | - | specializations/data-science-ml/computer-vision-training-pipeline.js |\n| Research Paper Publishing Process | NEW | ML research paper publishing and model documentation workflow | - | specializations/data-science-ml/research-paper-publishing.js |\n\n## Plugin Marketplace Mapping\n\n| Plugin Idea | Marketplace Status | Action | Existing Plugin | Target Placement |\n|-------------|-------------------|--------|-----------------|------------------|\n| Gradio App Builder | NEW | Rapid ML application prototyping with templates and deployment automation | - | plugins/a5c/marketplace/blueprints/gradio-app-builder/ |\n",
    "documents": []
  },
  "outgoingEdges": [],
  "incomingEdges": [
    {
      "from": "page:docs-reference-repos",
      "to": "page:docs-reference-repos-huggingface-skills-research",
      "kind": "contains_page"
    }
  ]
}

huggingface/skills json

Inspect the normalized record payload exactly as the atlas UI reads it.

File · wiki/docs/reference-repos/huggingface/skills/research.mdCluster · wiki

Record JSON

{
  "id": "page:docs-reference-repos-huggingface-skills-research",
  "_kind": "Page",
  "_file": "wiki/docs/reference-repos/huggingface/skills/research.md",
  "_cluster": "wiki",
  "attributes": {
    "nodeKind": "Page",
    "sourcePath": "docs/reference-repos/huggingface/skills/research.md",
    "sourceKind": "repo-docs",
    "title": "huggingface/skills",
    "displayName": "huggingface/skills",
    "slug": "docs/reference-repos/huggingface/skills/research",
    "articlePath": "wiki/docs/reference-repos/huggingface/skills/research.md",
    "article": "\n# huggingface/skills\n\n- **Archetype**: mega-skill-pack\n- **Stars**: 10,166\n- **Last pushed**: 2026-04-13\n- **License**: Apache-2.0\n- **Discovered**: 2026-04-13\n- **Source**: backlog-processing\n- **Skills found**: 11 (hf-cli, huggingface-community-evals, huggingface-datasets, huggingface-gradio, huggingface-llm-trainer, huggingface-paper-publisher, huggingface-papers, huggingface-tool-builder, huggingface-trackio, huggingface-vision-trainer, transformers-js)\n\n## Summary\nHugging Face's official skill collection for AI/ML development workflows, compatible with Claude Code, Codex, Gemini CLI, and Cursor. Contains 11 specialized skills covering the complete ML pipeline: datasets, training (LLM and vision), evaluation, Gradio apps, paper publishing, tool building, and deployment. Follows the standardized Agent Skills format with multi-harness plugin support and includes MCP server integration.\n\n## Assessment\nHIGH VALUE. This is an authoritative mega-skill-pack from Hugging Face containing production-grade ML workflows. The skills encode detailed procedural knowledge for complex tasks like model training (SFT, DPO, GRPO, reward modeling), dataset preparation and validation, evaluation pipeline setup, and Gradio app deployment. The training skills particularly contain sophisticated infrastructure management, cost estimation, and monitoring procedures that are directly extractable as specializations/data-science-ml/ processes.\n\n## Extraction Priority\nHIGH - Contains official Hugging Face workflows that are directly transferable:\n- ML model training pipelines -> specializations/data-science-ml/\n- Dataset preparation and validation workflows -> specializations/data-science-ml/\n- Model evaluation and benchmarking processes -> specializations/data-science-ml/\n- Gradio app development workflows -> specializations/frontend/ or specializations/data-science-ml/\n\n## Skills Inventory\n\n| Skill | Path | Domain | Transferable? | Notes |\n|-------|------|--------|---------------|-------|\n| huggingface-llm-trainer | skills/huggingface-llm-trainer/ | ML/Data | Yes - process | TRL training methods (SFT, DPO, GRPO), infrastructure management |\n| huggingface-datasets | skills/huggingface-datasets/ | ML/Data | Yes - process | Dataset preparation, validation, Hub integration |\n| huggingface-gradio | skills/huggingface-gradio/ | Frontend/ML | Yes - process | Interactive ML app development and deployment |\n| huggingface-community-evals | skills/huggingface-community-evals/ | ML/Data | Yes - process | Model evaluation and benchmarking workflows |\n| huggingface-vision-trainer | skills/huggingface-vision-trainer/ | ML/Data | Yes - process | Computer vision model training pipelines |\n| huggingface-paper-publisher | skills/huggingface-paper-publisher/ | ML/Academic | Yes - pattern | Research paper publishing and model documentation |\n| transformers-js | skills/transformers-js/ | Frontend/ML | Yes - process | Browser-based ML model deployment workflows |\n\n## Processes\n- **ml-model-training-pipeline**: Comprehensive workflow for training language models using TRL on Hugging Face infrastructure\n  - Source: skills/huggingface-llm-trainer/SKILL.md (lines 10-50)\n  - Placement: specializations/data-science-ml/\n  - Inputs: Training data, model configuration, hardware requirements\n  - Outputs: Trained model, training metrics, deployment artifacts\n  - Complexity: complex\n  - Notes: Covers SFT, DPO, GRPO, reward modeling, cost estimation, monitoring\n\n- **dataset-preparation-workflow**: Systematic process for preparing and validating datasets for ML training\n  - Source: skills/huggingface-datasets/SKILL.md\n  - Placement: specializations/data-science-ml/\n  - Inputs: Raw data, schema requirements, quality criteria\n  - Outputs: Validated dataset, metadata, quality report\n  - Complexity: moderate\n\n- **gradio-app-development**: End-to-end process for building interactive ML applications with Gradio\n  - Source: skills/huggingface-gradio/SKILL.md\n  - Placement: specializations/data-science-ml/\n  - Inputs: Model, UI requirements, deployment target\n  - Outputs: Interactive app, deployment configuration, user documentation\n  - Complexity: moderate\n\n- **model-evaluation-pipeline**: Systematic approach to evaluating and benchmarking ML models\n  - Source: skills/huggingface-community-evals/SKILL.md\n  - Placement: specializations/data-science-ml/\n  - Inputs: Model, evaluation datasets, benchmark criteria\n  - Outputs: Performance metrics, comparison reports, leaderboard submissions\n  - Complexity: moderate\n\n## Plugin Ideas\n- **gradio-app-builder**: Plugin for rapid ML application prototyping and deployment\n  - What install.md would do: Set up Gradio development environment, create app templates, configure deployment pipelines, install UI components\n  - Processes it would copy: gradio-app-development, ml-model-integration\n  - Configs/hooks it would create: Gradio templates, CSS themes, deployment scripts, monitoring configs\n  - Source evidence: huggingface-gradio skill with interactive app development workflows\n\n## Implicit Procedural Knowledge\n- **TRL Training Method Selection**: Process for choosing appropriate training method (SFT vs DPO vs GRPO) based on use case and data\n  - Source: huggingface-llm-trainer skill documentation and method comparison sections\n  - Placement: specializations/data-science-ml/\n  - Why codify: Provides systematic decision framework for training method selection in ML projects\n  - Sketch: Use case analysis -> Data type evaluation -> Method capability mapping -> Cost-performance trade-off -> Training method recommendation\n\n- **ML Infrastructure Cost Estimation**: Process for estimating and optimizing training costs on cloud ML infrastructure\n  - Source: Training skills' hardware selection and cost estimation guidance\n  - Placement: specializations/data-science-ml/\n  - Why codify: Systematic approach to ML infrastructure planning that's reusable across cloud providers\n  - Sketch: Model size analysis -> Training duration estimation -> Hardware requirements mapping -> Cost calculation -> Optimization recommendations\n\n## Library Mapping\n\n| Extractable Process | Library Status | Action | Existing Path | Target Placement |\n|-------------------|----------------|--------|---------------|------------------|\n| ML Model Training Pipeline | NEW | TRL training methods (SFT, DPO, GRPO) with infrastructure management | - | specializations/data-science-ml/ml-model-training-pipeline.js |\n| Dataset Preparation Workflow | NEW | Systematic dataset preparation and validation for ML training | - | specializations/data-science-ml/dataset-preparation-workflow.js |\n| Gradio App Development | NEW | End-to-end interactive ML application development process | - | specializations/data-science-ml/gradio-app-development.js |\n| Model Evaluation Pipeline | NEW | Systematic ML model evaluation and benchmarking methodology | - | specializations/data-science-ml/model-evaluation-pipeline.js |\n| TRL Training Method Selection | NEW | Decision framework for choosing SFT vs DPO vs GRPO training methods | - | specializations/data-science-ml/trl-training-method-selection.js |\n| ML Infrastructure Cost Estimation | NEW | Training cost estimation and optimization for cloud ML infrastructure | - | specializations/data-science-ml/ml-infrastructure-cost-estimation.js |\n| Computer Vision Training Pipeline | NEW | Specialized training pipeline for computer vision models | - | specializations/data-science-ml/computer-vision-training-pipeline.js |\n| Research Paper Publishing Process | NEW | ML research paper publishing and model documentation workflow | - | specializations/data-science-ml/research-paper-publishing.js |\n\n## Plugin Marketplace Mapping\n\n| Plugin Idea | Marketplace Status | Action | Existing Plugin | Target Placement |\n|-------------|-------------------|--------|-----------------|------------------|\n| Gradio App Builder | NEW | Rapid ML application prototyping with templates and deployment automation | - | plugins/a5c/marketplace/blueprints/gradio-app-builder/ |\n",
    "documents": []
  },
  "outgoingEdges": [],
  "incomingEdges": [
    {
      "from": "page:docs-reference-repos",
      "to": "page:docs-reference-repos-huggingface-skills-research",
      "kind": "contains_page"
    }
  ]
}