stack-profile:prompt-engineering-workbench
Prompt Engineering Workbench (TypeScript, React, PostgreSQL, LLM APIs, Redis) overview
A developer-facing prompt engineering workbench that provides a React-based playground for authoring, versioning, and A/B testing prompts against multiple LLM providers. PostgreSQL stores prompt versions, evaluation datasets, and scored results. Redis caches LLM responses for rapid iteration during prompt development. The TypeScript backend (via Hono) proxies LLM API calls with token tracking, cost attribution, and latency measurement. Zod validates prompt templates and evaluation schemas. Designed for AI product teams and prompt engineers who need structured experimentation beyond ad-hoc notebook workflows. The tradeoff is evaluation subjectivity — automated scoring captures surface-level quality but often requires human rating for nuanced prompt quality assessment.
Attributes
Outgoing edges
- domain:ml-ai·DomainML/AI
- domain:software-engineering·DomainSoftware Engineering
- language:typescript·LanguageTypeScript
- framework:react·FrameworkReact
- library:zod·LibraryZod
- library:ioredis·Libraryioredis
- framework:hono·FrameworkHono
- library:prisma·LibraryPrisma
- library:zustand·LibraryZustand
- library:tailwindcss·LibraryTailwind CSS
- workflow:prompt-engineering-iteration·WorkflowPrompt Engineering Iteration
- workflow:agent-evaluation-cycle·WorkflowAgent Evaluation Cycle
- skill-area:prompt-engineering·SkillAreaPrompt Engineering
- skill-area:ai-evaluation·SkillAreaAI Evaluation
- skill-area:frontend-development·SkillAreaFrontend Development
- skill-area:backend-api-design·SkillAreaBackend API Design
- skill-area:data-analytics·SkillAreaData Analytics
- role:ml-engineer·RoleMachine Learning Engineer
- role:frontend-engineer·RoleFrontend Engineer
- role:research-engineer·RoleResearch Engineer