6.4 KiB
AI Multimodal Integration for Frontend Design
Entry point for using the ck:ai-multimodal skill to generate and analyze visual assets that align with frontend design thinking and aesthetic guidelines.
When to Use
Use ck:ai-multimodal in frontend design when you need to:
Asset Generation:
- Generate hero images, background assets, decorative elements
- Create placeholder images with specific aesthetic qualities
- Produce marketing visuals that match UI design language
- Generate icon sets, illustrations, or graphic elements
- Create texture overlays, gradient meshes, or background patterns
- Prototype visual concepts before implementing in code
Visual Analysis:
- Analyze generated assets to verify they meet design standards
- Compare multiple variations objectively with ratings
- Extract exact color palettes with hex codes for implementation
- Test assets with UI overlays for readability and contrast
Design Extraction:
- Extract design guidelines from existing images or videos
- Analyze competitor designs to understand their approach
- Reverse-engineer design systems from inspiration screenshots
- Create documented guidelines based on visual analysis
- Establish consistent aesthetic direction from references
Core Principles
1. Design-Driven Generation
NEVER generate generic AI imagery. Every asset must align with:
- The chosen aesthetic direction (brutalism, maximalism, retro-futurism, etc.)
- Typography system and visual hierarchy
- Color palette and theme consistency
- Overall design story and purpose
2. Contextual Asset Creation
Assets aren't standalone—they're part of a cohesive interface. Consider:
- Purpose: Hero image vs. background texture vs. decorative element
- Integration: How it interacts with overlaid text, buttons, forms
- Technical constraints: File size, aspect ratio, responsive behavior
- Accessibility: Color contrast, text readability, decorative vs. informative
3. Analysis is Mandatory
Never integrate assets without comprehensive analysis:
- Score quality objectively (1-10 scale, minimum 7/10)
- Extract specific values: hex codes, not "blue"; px sizes, not "large"
- Compare multiple variations before deciding
- Test with UI overlays, not in isolation
4. Learn from Excellence
Extract design systems systematically from high-quality references:
- Analyze 3-5 screens to identify patterns
- Document actionably with CSS variables and exact values
- Validate predictions (fonts, colors) manually
- Adapt principles contextually, don't copy blindly
Workflow Quick Reference
For Asset Generation
See: asset-generation.md
- Define design context (aesthetic, colors, typography, tone)
- Craft design-driven prompts (not generic)
- Generate with appropriate Imagen 4 model
- Analyze and verify quality (score ≥ 7/10)
- Iterate or integrate based on results
Models: imagen-4.0-generate-001 (standard), imagen-4.0-ultra-generate-001 (production), imagen-4.0-fast-generate-001 (iteration)
For Visual Analysis
See: visual-analysis.md
- Define evaluation criteria (context-specific)
- Run comprehensive analysis with structured prompts
- Compare multiple variations objectively
- Extract color palettes with hex codes
- Test integration with UI elements
Model: gemini-2.5-flash (vision understanding)
For Design Extraction
See: design-extraction.md
- Capture high-quality reference screenshots
- Extract comprehensive design elements systematically
- Analyze multiple screens for consistent patterns
- Extract motion guidelines from videos (if applicable)
- Document actionably with CSS-ready specifications
Model: gemini-2.5-flash (vision understanding)
Integration with Other Skills
With aesthetic Skill
Use aesthetic for overall design system guidance and quality evaluation framework. Then use ck:frontend-design with ck:ai-multimodal for asset generation and analysis that follows those guidelines.
With ck:chrome-devtools Skill
Use ck:chrome-devtools to capture full-screen screenshots from inspiration websites for design extraction. Capture at actual viewport size, not full-page scrolls.
With ui-styling Skill
Generate and analyze assets first, then implement using shadcn/ui + Tailwind with colors/styles that complement the generated imagery.
With web-frameworks Skill
Optimize generated assets for Next.js App Router: image optimization, responsive images, lazy loading.
With ck:media-processing Skill
Post-process generated assets: resize, compress, add filters, create compositions using FFmpeg/ImageMagick.
Navigation
Detailed Workflows:
asset-generation.md- Complete generation workflow with prompt strategiesvisual-analysis.md- Analysis and verification workflowdesign-extraction.md- Extract guidelines from existing designs
Additional Resources:
technical-guide.md- File optimization, examples, checklists, common pitfallsanimejs.md- Animation implementation for frontend
Quick Commands
Generate asset:
python scripts/gemini_batch_process.py \
--task generate \
--prompt "[design-driven prompt]" \
--output docs/assets/[name] \
--model imagen-4.0-generate-001 \
--aspect-ratio 16:9
Analyze asset:
python scripts/gemini_batch_process.py \
--files docs/assets/[image].png \
--task analyze \
--prompt "[evaluation criteria]" \
--output docs/assets/analysis.md \
--model gemini-2.5-flash
Extract design guidelines:
python scripts/gemini_batch_process.py \
--files docs/inspiration/[reference].png \
--task analyze \
--prompt "[extraction criteria from design-extraction.md]" \
--output docs/design-guidelines/extracted.md \
--model gemini-2.5-flash
Remember
- Design First, Generate Second: Start with design thinking, not generation capabilities
- Context is King: Every asset serves the interface, not itself
- Iterate Ruthlessly: First generation is rarely final—evaluate and refine
- Analysis is Mandatory: Never integrate without comprehensive verification (≥7/10)
- Demand Specifics: Hex codes not "blue", px not "large", ms not "fast"
- Learn from Excellence: Extract design systems from high-quality references systematically
- Adapt, Don't Copy: Understand principles, apply contextually to your unique design
Generate assets that elevate frontend design, maintain aesthetic consistency, and serve user experience—never generic, always contextual.