× Soumya Savarn

About Me

Hi, I'm Soumya Savarn, a senior at IIT Guwahati, pursuing a B.Tech in Data Science and Artificial Intelligence. I have a keen interest in competitive programming, deep learning, especially generative models and web development. I also explore topics in Computer Science through my B.Tech minor course.

Welcome to My Portfolio

Explore my work, read my latest blogs, and get in touch!

Blogs

Everything about Autoencoders: From Linear Algebra to its multimodal and cross-modal generative capabilities using VAEs
May 2025 • Multimodal Unsupervised Learning
Autoencoders are a type of artificial neural network used for unsupervised learning. Their primary goal is to learn efficient representations of input data, typically for dimensionality reduction or noise removal. However, there is a lot more inherent in its architecture.

[20 minute read] • Artificial Intelligence (Intermediate) • Pre-requisites: Multi Layer Perceptrons and vanilla Autoencoders After this tutorial, you would be able to build your own multimodal generative model from scratch.

Projects

Synthetic Dataset Generation using CGANs
April 2025 • Python, Pytorch, Streamlit
Developed two Conditional Generative Adversarial Networks, first one is CTGAN for tabular data and second one is Pix2Pix for image data to comic face. The project includes a web application for generating synthetic datasets using these models. The CTGAN model trains on the fly and generates synthetic data, while the Pix2Pix model is trained on NVIDIA P100 GPU for 9.5 hours on the Comic Face Dataset on Kaggle. The web app allows users to upload images and download generated datasets.
Campus Collab
Kriti - Inter Hostel Tech Event, IIT Guwahati • Jan. 2024 - Feb. 2024 • MongoDB, Express.js, React.js, Node.js, Figma
A web platform for campus students to find collaborators for their projects or a project to work on. Implemented major dynamic content in the frontend using React.js, reducing codebase and response time. Implemented a spam filtering algorithm using a heuristic criterion and reduced time taken by 67 ms per message.
Army Database
May 2024 • Database Design, Streamlit, SQL
A database design project for an Indian Army Camp, North Guwahati that supports daily local transactions in a single place while maintaining ACID properties. This system helps manage personnel, equipment, and operations data efficiently.
FitAI
April 2024 • Python, Flask, GenAI
A fitness tracker and predictor application that uses both generative AI and classical ML algorithms for fitness-related predictions. Features include activity tracking, goal setting, and predictive analysis. Deployed and accessible online as of February 2025.
Real Estate Search System
December 2023 • C++
A real estate property search and management system implemented in C++. This application allows users to search, filter, and manage real estate listings with an efficient search algorithm.
Terminal-based Text Editor
December 2023 • C++, Data Structures
A basic terminal-based text editor similar to Vim, implemented from scratch in C++ using pointers and linked list data structures. This project demonstrates low-level text manipulation and editor functionality.

Work Experience

Microsoft
SWE Internship • May 2025 - July 2025
  • Team: M365 Sovereign Cloud Buildout
  • Responsible for automation tasks in Stage 0/1 buildout.
  • Led database migration of Azure CosmosDB inventory data from Azure Global (US West3) to Azure Bleu (sovereign cloud of France).
  • Debugged the pipeline, improving performance by 3.27×, and developed an asynchronous validation script using Azure Functions that verifies 300,000+ records in under 1–2 minutes.
  • Explored two approaches for database migration from CosmosDB emulator (VM) to CosmosDB account within the same tenant:
    • Azure CosmosDB Desktop Data Migration Tool: ~7 records/sec
    • Custom C# Script: 100+ records/sec — superior in speed and security
  • Tech Stack: Azure Data Factory, Azure Cosmos DB, C#

Contact

Feel free to reach out to me at soumyasavarn2@gmail.com