Results-driven Data Engineer with 5+ years of experience building and optimizing ETL pipelines, cloud data platforms, and analytics solutions across healthcare, banking, and enterprise domains. Skilled in Azure, AWS, Snowflake, Databricks, Python, PySpark, SQL, and Power BI. Strong focus on data quality, governance, compliance, and real-time data processing to deliver actionable business insights.
AWS Certified Data Engineer Associate
AWS (S3, Glue, Redshift, Athena, EMR, Lambda, Kinesis, SageMaker), Azure (Data Factory, Data Lake, Synapse, Databricks, Blob Storage, Azure SQL), GCP (BigQuery, Google Analytics), Apache Spark, PySpark, Spark SQL, Hadoop, Hive, Pig, Delta Lake, Kafka, Python, SQL (Advanced), Scala (Spark), R, SAS, Informatica PowerCenter & Data Quality, Talend, Matillion, Apache NiFi, Apache Airflow, Control-M, Snowflake, MS SQL Server, MySQL, PostgreSQL, MariaDB, Oracle Exadata, SSAS, Star Schema, Snowflake Schema, Data Lakes, Power BI (DAX, Power Query), Tableau, QlikView, SSRS, Scikit-learn, PySpark MLlib, AWS SageMaker, Statistical Analysis, Hypothesis Testing, Git, GitHub, GitLab, Azure DevOps, Terraform, GitLab CI/CD, GitHub Actions, RBAC, Encryption Policies, Data Masking, JIRA, Confluence