
Hello, I'm
Sachin Loddiya Karthik
Building Scalable Data Solutions

Get To Know More
About Me
I'm a Data Engineer who thrives on transforming complex data into actionable insights. With over 4 years of experience, I specialize in building and automating robust ETL pipelines and architecting scalable data lakehouses in the cloud.
My passion lies at the intersection of data and cloud technology, where I enjoy leveraging tools like PySpark, Azure Databricks, and Synapse to design solutions that not only meet technical requirements but also drive real business outcomes. I'm driven by a constant curiosity to learn and a desire to build data systems that are not just efficient, but elegant.
Explore My
Experience
Data Scientist
Jan 2025 - Apr 2025
Western Michigan University, USA
- Designed an automated class scheduling solution using Google OR-Tools and constraint programming, reducing manual scheduling time by 85% and optimizing faculty-course assignments for 200+ courses across 15 departments.
- Created and deployed a Streamlit-based UI processing 50+ CSV files with real-time validation, reducing data input errors by 90% and enabling interactive visualizations of optimized schedules for 100+ faculty members.
- Minimized scheduling conflicts by 95% and eliminated 40+ hours of weekly manual effort through integration of advanced optimization algorithms and a user-friendly frontend interface.
Data Scientist
Jan 2024 - Apr 2024
Green Expectations LLC, USA
- Built the backend infrastructure for an AI-powered home sustainability platform, processing 1K+ data points per user via a rule-based House Sustainability Calculator, boosting recommendation accuracy by 35%.
- Led the development of a secure user authentication system with persistent storage, enabling 100% individualized access and supporting real-time tracking for 500+ active users.
- Engineered data flows for an NLP-driven conversational AI chatbot, reducing response latency by 40% and increasing user engagement by 25%.
Data Engineer
Jul 2021 - Jul 2023
Accenture, India
- Engineered ETL pipelines with Azure Data Factory to migrate and reformat critical supply chain data into Parquet format, boosting processing efficiency by 40% while minimizing disruption.
- Utilized Azure Databricks to process raw supply chain data from Data Lake Gen2, enhancing accuracy by 25% and halving transformation times.
- Developed external tables and views within Azure SQL and implemented interactive Power BI dashboards, enhancing data access and delivering insights that resulted in a 25% increase in operational efficiency.
- Improved data workflows by integrating ADF triggers, Databricks notebooks, and CI/CD pipelines via Azure DevOps, boosting team efficiency by 15% and accelerating SCPO upgrade project delivery by 30%.
Data Engineer
Nov 2020 - May 2021
Claritrics India Pvt Limited, India
- Pioneered an advanced ETL pipeline using Azure Data Factory and Azure Databricks to create an OCR solution for extracting text from images, improving extraction accuracy by 30%.
- Crafted and scheduled an end-to-end automated workflow using Azure Data Factory for orchestration and Azure Databricks for scalable image processing, reducing text extraction processing time by 40%.
- Enhanced development and deployment cycles with Azure Data Factory triggers and Databricks job scheduling, resulting in a 20% gain in team efficiency and a 30% acceleration of project timelines.
Explore My
Technical Skills
Data Engineering
Data Science
Data Analysis & BI
Browse My Recent
Projects
My Academic Journey
Education

Masters in Data Science
Aug 2023 - Apr 2025Western Michigan University, USA

B.E in Electronics and Communication Engineering
Aug 2016 - May 2020Anna University, India
My Achievements
Certifications
Google Cloud Big Data and Machine Learning Fundamentals
Modernizing Data Lakes and Data Warehouses with Google Cloud
Foundations: Data, Data, Everywhere
Gen AI Foundational Models for NLP & Language Understanding
Generative AI and LLMs: Architecture and Data Preparation
ETL and Data Pipelines with Shell, Airflow and Kafka
Introduction to Relational Databases (RDBMS)
Databases and SQL for Data Science with Python (with Honors)
Data Analytics Essentials
Introduction to Data Science
Apache Kafka
Azure Data Factory for Data Engineers
Azure Databricks & Spark For Data Engineers (PySpark / SQL)
Azure Synapse Analytics For Data Engineers
SQL for Data Science
Get in Touch