START Project Documentation Hub
Welcome to the comprehensive documentation for the START (Scalable, Tailored Active-inference Research & Training) project - an advanced AI-powered system for creating personalized Active Inference and Free Energy Principle curricula.
🚀 What is START?
START is a complete educational content generation pipeline that combines:
- Real-time research using Perplexity API for current domain insights
- Advanced LLM-based content generation via OpenRouter for professional-quality curricula
- Comprehensive personalization tailored to specific learners and professional domains
- Multilingual capabilities with full cultural adaptation
- Rich visualizations including charts, diagrams, and interactive elements
📚 Documentation Structure
Core Guides
🛠️ Setup & Development
- Environment Setup - Complete installation, configuration, and development guide
- Prerequisites, API setup, dependency management
- Development workflow, testing, troubleshooting
- IDE integration and advanced configuration
🔄 System Architecture
- Pipeline Overview - Comprehensive system architecture and workflow
- 4-stage curriculum creation pipeline
- Configuration-driven research approach
- API integration and content generation standards
🔗 External Integrations
- Repository & Clone Management - External resource integration
- Active Inference Institute ecosystem integration
- Knowledge graph and implementation repositories
- Educational resource enhancement strategies
Specialized Documentation
📖 User Guides
🔧 Configuration Reference
- Entity Configuration:
data/config/entities.yaml - Target learner profiles
- Domain Configuration:
data/config/domains.yaml - Professional domain definitions
- Language Configuration:
data/config/languages.yaml - Translation targets
📦 Data & Outputs
📊 Visualizations
🌍 Translations
🧭 Docs & Deployment
📝 Prompt Engineering
- Domain Analysis Templates:
data/prompts/research_domain_analysis.md
- Curriculum Generation Templates:
data/prompts/research_domain_curriculum.md
- Personalization Templates:
data/prompts/research_entity.md
- Translation Framework:
data/prompts/translation.md
🎯 Quick Start Paths
For New Users
- Environment Setup - Get up and running
- Pipeline Overview - Understand the system
- Usage Guide (GitHub)
For Developers
- Environment Setup - Development environment
- API Docs (GitHub)
- Tests (GitHub)
For Researchers
- Pipeline Overview - Research capabilities
- Clone Management - Access research repositories
- Configuration Files - Customize research targets
🌐 External Resources
Active Inference Institute Ecosystem
Math & Programming Resources
In-Repo Entry Points
📋 System Capabilities
Research & Analysis
- ** Professional Domains**: Life sciences, technology, business, healthcare, education, and whatever you prefer.
- 8 Target Entities: Political figures, scientists, tech leaders, educators, any entity or audience you prefer
- Real-time Research: Current industry insights and professional analysis
- Comprehensive Analysis: 3,000-5,000 word domain reports
Content Generation
- Professional-Grade Curricula: 40-60 hour structured learning programs, eventually with more granular and custom productions possible in this repo
- Personalized Learning: 5,000-8,000 word tailored strategies
- Modular Design: 3-5 hour comprehensive learning units
- Assessment Integration: Built-in evaluation and progress tracking
- Data Visualizations: PNG charts with curriculum metrics and analysis
- Process Diagrams: Mermaid diagrams for structure and flow
- Interactive Elements: Visual learning aids and conceptual frameworks
Multilingual Support
- 9+ Languages: Chinese, Spanish, Arabic, Hindi, French, Japanese, Russian, Swahili, Tagalog
- Cultural Adaptation: Full localization beyond literal translation
- Professional Quality: Native-speaker level fluency with technical accuracy
🔧 Configuration Overview
Research Configuration
# data/config/entities.yaml
entities:
- name: "karl_friston"
category: "scientist"
priority: "high"
# data/config/domains.yaml
domains:
- name: "biochemistry"
category: "life_sciences"
priority: "high"
Command-Line Interface
# From repository root
export PYTHONPATH=$(pwd):$PYTHONPATH
# Research high-priority entities
uv run python learning/curriculum_creation/1_Research_Entity.py --priority high
# Generate domain-specific curricula
uv run python learning/curriculum_creation/1_Research_Domain.py --domain biochemistry
# Create multilingual content
uv run python learning/curriculum_creation/4_Translate_Introductions.py --languages Spanish French
📊 Project Structure
START/
├── src/ # Core system implementation
├── learning/ # Curriculum creation scripts
├── data/ # Generated content and configuration
├── docs/ # Comprehensive documentation
├── tests/ # Test suite and validation
└── README.md # Project overview and quick start
🔄 Development Workflow
Standard Development Cycle
- Configure targets in
data/config/ YAML files
- Run research using domain and entity scripts
- Generate curricula with comprehensive content creation
- Create visualizations for enhanced learning
- Translate content for multilingual accessibility
Quality Assurance
- Comprehensive testing with pytest and TDD approach
- Code quality with ruff linting and black formatting
- API integration testing for Perplexity and OpenRouter
- Content validation against Active Inference standards
📞 Getting Help
Documentation Resources
- This documentation hub for comprehensive guides
- Inline code documentation with detailed docstrings
- Example usage in test files and usage guides
- Configuration examples in YAML files
- Active Inference Institute for research questions
- GitHub Issues for technical problems and feature requests
- Test Suite for usage examples and validation patterns
START represents a new paradigm in educational content creation, combining cutting-edge AI research capabilities with comprehensive pedagogical design to produce world-class Active Inference curricula tailored to any professional domain or individual learner.
Institute Shortlinks (email-friendly)