What is Syda?¶
Syda seamlessly generate realistic synthetic test data - structured, unstructured, PDF, and HTML data generation with AI and large language models while preserving referential integrity, maintaining privacy compliance, and accelerating development workflows using OpenAI, Anthropic, and Gemini models
Key Features¶
Multi-Provider AI Integration:
- Seamless integration with multiple AI providers
- Support for OpenAI (GPT), Anthropic (Claude) and Google(Gemini).
- Default model is Anthropic Claude model claude-3-5-haiku-20241022
- Consistent interface across different providers
- Provider-specific parameter optimization
LLM-based Data Generation:
- AI-powered schema understanding and data creation
- Contextually-aware synthetic records
- Natural language prompt customization
- Intelligent schema inference
Multiple Schema Formats:
- YAML/JSON schema file support with full foreign key relationship handling
- SQLAlchemy model integration with automatic metadata extraction
- Python dictionary-based schema definitions
Referential Integrity
- Automatic foreign key detection and resolution
- Multi-model dependency analysis through topological sorting
- Robust handling of related data with referential constraints
Custom Generators
- Register column- or type-specific functions for domain-specific data
- Contextual generators that adapt to other fields (like ICD-10 codes based on demographics)