Summary
Overview
Work History
Education
Skills
Certification
Projects
Timeline
Generic

Kristen Cranford

Louisburg,NC

Summary

Data professional with 9+ years of experience in data analytics, transitioning into AI Quality Control (QC) and Data Science. Strong expertise in Python, SQL, and machine learning with a background in data validation, anomaly detection, and automation. Enthusiastic about ensuring AI model integrity by identifying bugs, troubleshooting AI systems, and improving data pipelines. Currently pursuing an M.S. in Data Science to deepen expertise in AI/ML model evaluation and quality assurance.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Data Analyst | AI Quality Control & Data Validation

Metabolon
06.2024 - Current
  • Ensuring AI Model Integrity by validating training datasets and identifying inconsistencies before deployment
  • Developing Python scripts for data preprocessing, anomaly detection, and automated data validation
  • Working with AI engineers to troubleshoot model issues, ensuring models generalize well to real-world data
  • Conducting exploratory data analysis (EDA) to detect biases, missing values, and inconsistencies in datasets

Data Analyst | Biological Development & AI QC

Metabolon
05.2023 - 06.2024
  • Implemented machine learning techniques to clean and preprocess biological datasets for biomarker discovery
  • Developed Python-based automation tools for handling missing data, duplicates, and formatting inconsistencies
  • Validated datasets for AI model training, ensuring data integrity and reducing biases
  • Built REST APIs and R-based data pipelines to integrate biological data with AI models in a Linux-based environment
  • Conducted SQL-based data extraction for high-volume biological datasets, ensuring accurate reporting
  • Created various Excel documents to assist with pulling metrics data and presenting information to stakeholders for concise explanations of best placement for needed resources.

Data Analyst | Data Curation & AI Model Validation

Metabolon
01.2022 - 05.2023
  • Supervised data processing pipelines to optimize AI training datasets
  • Collaborated with cross-functional teams to enhance machine learning models used for health assessments
  • Developed quality control strategies to validate outputs from AI-driven analyses
  • Led AI-driven pattern recognition projects to uncover trends in large-scale scientific datasets
  • Produced monthly reports using advanced Excel spreadsheet functions.

Supervisor - R&D Coordinator, Data Curation

Metabolon
01.2021 - 12.2021
  • Managed a team of nine data analysts, overseeing workflows, process improvements, and project assignments
  • Led internal process improvements for data curation, ensuring higher accuracy and efficiency in data processing
  • Developed analytical method enhancements and optimized data pipelines for scientific research
  • Provided performance assessments and mentorship, guiding analysts in advanced data processing techniques

Research Scientist | Data Curation & AI Training Data QC

Metabolon
01.2017 - 12.2020
  • Worked with machine learning engineers to optimize data pipelines for AI model training
  • Built automation scripts for processing and labeling large-scale biological data
  • Led quality control initiatives, ensuring accuracy in experimental results used for AI-based analytics

Assistant Scientist | Data Curation

Metabolon
08.2015 - 12.2016
  • Performed routine LC/MS instrument maintenance and data analysis
  • Collaborated with mass spectrometrists on data analysis to improve method performance
  • Maintained accurate records of analytical procedures and results for regulatory compliance
  • Generated high-quality deliverables that met customer requirements in a timely manner

Education

Master of Science - Data Science

University of Colorado - Boulder
Boulder, CO
12.2026

Bachelor of Science - Chemistry, Biochemistry Concentration, Math Minor

UNC-Wilmington
Wilmington, NC
12.2014

Skills

  • Programming & Tools: Python (Pandas, NumPy, Scikit-learn), SQL, Git, Jupyter Notebooks
  • Machine Learning & AI: Supervised & Unsupervised Learning, Model Evaluation, Bias Detection, Data Drift Analysis
  • Data Processing & QC: Data Cleaning, Feature Engineering, ETL, Data Preprocessing, Anomaly Detection, Data Filtering, Data Formatting, Data Operations
  • Data Visualization: Power BI, Tableau, Matplotlib, Seaborn, Visual Data Representation
  • Quality Control & Defect Management: Defect Tracking
  • Cloud & Deployment: AWS, Streamlit, Flask
  • Software & Systems: Linux, REST APIs, LabVantage LIMS

Certification

  • Expressway to Data Science: Essential Math Specialization – University of Colorado Boulder (2025)
  • Machine Learning for All – University of London (2024)
  • Generative AI for Data Scientists – IBM (2024)
  • Google Data Analytics Professional Certificate – Google (2023)
  • R Programming in Data Science: High Variety Data – LinkedIn (2023)
  • Cleaning Bad Data in R – (2023)
  • [Pursuing] IBM Data Science Professional Certificate – IBM (2025)

Projects

Molecular Dynamics Study of TP10 Peptide | Computational Modeling
  • Conducted molecular dynamics simulations to analyze peptide binding, folding, and insertion into membranes.
  • Utilized computational modeling techniques to predict peptide behavior in zwitterionic environments.
  • Processed and visualized data using GROMACS, molecular dynamics tools, and statistical analysis.
  • Co-authored a peer-reviewed publication based on the research findings.
AI Data Cleaning Assistant | Python, Streamlit, OpenAI API
  • Built a Streamlit-powered AI assistant that analyzes and suggests cleaning techniques for messy datasets.
  • Integrated GPT-4 API to provide data validation recommendations for missing values, duplicates, and inconsistencies.
  • Automated data preprocessing tasks to improve dataset quality for machine learning models.
AI Bias & Model Drift Detector | Python, Pandas, Scikit-learn
  • Developed a bias detection tool for AI datasets, identifying imbalances in training data.
  • Implemented statistical methods to detect model drift in AI predictions.
  • Used automated anomaly detection to flag unreliable AI outputs.
Customer Churn Prediction | Machine Learning Project
  • Created a predictive model using logistic regression, decision trees, and gradient boosting.
  • Feature engineered datasets to optimize model accuracy for a telecom company.

Timeline

Data Analyst | AI Quality Control & Data Validation

Metabolon
06.2024 - Current

Data Analyst | Biological Development & AI QC

Metabolon
05.2023 - 06.2024

Data Analyst | Data Curation & AI Model Validation

Metabolon
01.2022 - 05.2023

Supervisor - R&D Coordinator, Data Curation

Metabolon
01.2021 - 12.2021

Research Scientist | Data Curation & AI Training Data QC

Metabolon
01.2017 - 12.2020

Assistant Scientist | Data Curation

Metabolon
08.2015 - 12.2016

Master of Science - Data Science

University of Colorado - Boulder

Bachelor of Science - Chemistry, Biochemistry Concentration, Math Minor

UNC-Wilmington
Kristen Cranford