Large scale simulations made simple.
-
Updated
Aug 14, 2025 - HTML
Large scale simulations made simple.
An open-source software for synthetic web-based user interface and content dataset generation.
UNECE HLG-MOS Synthetic Data Challenge (TEAM DESTATIS)
AAAI2025 Paper (Oral) "SS-GEN: A Social Story Generation Framework with Large Language Models" (SS-GEN)
Repository for my Master Thesis on Gossiping Protocols and Information Propagation. Includes mathematical models, simulations, and applications to study decentralized systems and optimize information dissemination.
LLM-Powered Dataset Creation Tool
Testing out statistical models for generating synthetic tabular data.
Bulk Synthetic Data Generation
🔬 SciPyMasterPro — A hands-on, modular project to master SciPy for statistics, optimization, linear algebra, curve fitting, and simulations. Includes 10+ Jupyter notebooks, an interactive Streamlit app, synthetic datasets, reusable utility functions, Dockerized setup, and cheatsheets for fast recall, portfolio building, and interview prep.
Fraud Detection of a 6 million row dataset using AWS and Spark
🐙 KnowledgeBase: Interactive PostgreSQL troubleshooting hub with searchable errors, copy-ready SQL examples, AI chat, analytics and PDF export to resolve issues.
Synthetic Data in HealthCare
Add a description, image, and links to the synthetic-dataset-generation topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-dataset-generation topic, visit your repo's landing page and select "manage topics."