Summary
Overview
Work History
Education
Certification
Technical Skillset
References
Timeline
Generic

Rachana Marneni

Charlotte

Summary

Results-driven Data Engineer with 5+ years of experience building and optimizing ETL pipelines, cloud data platforms, and analytics solutions across healthcare, banking, and enterprise domains. Skilled in Azure, AWS, Snowflake, Databricks, Python, PySpark, SQL, and Power BI. Strong focus on data quality, governance, compliance, and real-time data processing to deliver actionable business insights.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Data Engineer

Blue Cross Blue Shield | Claims Data Platform
02.2024 - Current
  • Architected and optimized Azure Synapse Analytics and Databricks pipelines for claims and provider data, reducing query execution times by 40%.
  • Designed and implemented Snowflake pipelines integrated with AWS S3 and Glue (Streams, Snowpipe), reducing ETL processing from 3 hours to under 1 hour for member and policy datasets.
  • Built real-time streaming pipelines using Kafka and AWS Kinesis, lowering claims data latency from 2 hours to
  • Automated data quality checks with Informatica, Talend, PySpark, and SQL, improving accuracy and ensuring HIPAA-compliant data management.
  • Implemented RBAC, encryption, and data masking in Azure and Snowflake, securing PHI and supporting HIPAA compliance.
  • Delivered Power BI dashboards with DAX and real-time data connections, enabling healthcare operations and executive teams to make faster, data-driven decisions.

Data Engineer

PNC Bank | Enterprise Data Warehouse Modernization
06.2021 - 08.2022
  • Migrated legacy ETL workflows to Azure Data Factory, automating 80% of manual finance and risk processes.
  • Designed and implemented a Snowflake data warehouse on Azure, improving reporting speed and supporting faster regulatory and SOX-compliant reporting.
  • Developed and optimized ETL pipelines in Azure Data Factory & Databricks (PySpark, SparkSQL, Scala Spark), reducing nightly batch runtime from 6 hours to 1.5 hours for transaction and account data.
  • Built real-time data ingestion pipelines using Azure Event Hub, Databricks, and Kafka, ensuring timely availability of banking and transactional data for analytics.
  • Implemented RBAC, encryption, and secure data-sharing policies in Snowflake, maintaining SOX and internal compliance standards.
  • Delivered Power BI dashboards with optimized DAX queries and real-time data connections, enabling finance and risk teams to track KPIs and make data-driven decisions.

Data Analyst

NTT Data | Enterprise Data Consolidation
01.2019 - 06.2021
  • Built and maintained ETL pipelines on AWS Glue and Redshift, consolidating enterprise data from multiple sources into a centralized warehouse.
  • Developed data ingestion workflows using Apache NiFi and Spark on AWS EMR, reducing pipeline execution time by 25% and improving data availability.
  • Automated reporting and analytics using Tableau and Power BI, enabling executives to track KPIs with daily refreshes instead of weekly.
  • Designed predictive and statistical models using PySpark, Scikit-learn, and SAS, improving forecasting accuracy for business stakeholders.
  • Conducted data quality checks and cleansing using Informatica and Matillion, ensuring high integrity across cloud and on-premise systems.
  • Implemented data governance practices, including lineage tracking and secure access policies, supporting compliance and audit requirements.

Education

Master of Science - Computer and Information Sciences

Western Illinois University
IL, USA
12-2023

Certification

AWS Certified Data Engineer Associate

Technical Skillset

AWS (S3, Glue, Redshift, Athena, EMR, Lambda, Kinesis, SageMaker), Azure (Data Factory, Data Lake, Synapse, Databricks, Blob Storage, Azure SQL), GCP (BigQuery, Google Analytics), Apache Spark, PySpark, Spark SQL, Hadoop, Hive, Pig, Delta Lake, Kafka, Python, SQL (Advanced), Scala (Spark), R, SAS, Informatica PowerCenter & Data Quality, Talend, Matillion, Apache NiFi, Apache Airflow, Control-M, Snowflake, MS SQL Server, MySQL, PostgreSQL, MariaDB, Oracle Exadata, SSAS, Star Schema, Snowflake Schema, Data Lakes, Power BI (DAX, Power Query), Tableau, QlikView, SSRS, Scikit-learn, PySpark MLlib, AWS SageMaker, Statistical Analysis, Hypothesis Testing, Git, GitHub, GitLab, Azure DevOps, Terraform, GitLab CI/CD, GitHub Actions, RBAC, Encryption Policies, Data Masking, JIRA, Confluence

References

References available upon request.

Timeline

Data Engineer

Blue Cross Blue Shield | Claims Data Platform
02.2024 - Current

Data Engineer

PNC Bank | Enterprise Data Warehouse Modernization
06.2021 - 08.2022

Data Analyst

NTT Data | Enterprise Data Consolidation
01.2019 - 06.2021

Master of Science - Computer and Information Sciences

Western Illinois University
Rachana Marneni