Mohammad Waqas

Computational Biologist & MD Candidate
Boston, US.

About

Computational biologist and MD Candidate with deep expertise in large-scale omics data analysis, specializing in the genetic architecture of Alzheimer’s disease. Drives impactful research by leveraging advanced AI/ML, genomic, and statistical methodologies across multi-terabyte datasets, optimizing workflows for 7x speed and 3x cost efficiency. Proven leader in methods development, technical collaboration, and end-to-end analytical ownership, translating complex data into population-scale insights.

Work

Massachusetts General Hospital
|

Senior Research Technician

Boston, MA, US

Summary

Spearheads advanced computational biology initiatives at Massachusetts General Hospital, leveraging multi-terabyte omics data to unravel the genetic architecture of Alzheimer's disease.

Highlights

Owned and executed analytic strategy for multi-terabyte omics and AI/ML programs (WGS, metabolomic, proteomic) across UK Biobank, All of Us, NIAGADS, and MGB Biobank, enabling consistent cross-cohort inference at population scale.

Architected a novel parallelized WGS GWAS workflow on DNAnexus cloud infrastructure, achieving a 7x wall-time reduction, 3x cost reduction, and 5x storage reduction relative to existing pipelines.

Led identification and mitigation of statistical and methodological risks in large-scale association studies, including population structure artifacts, confounding, and calibration failures.

Individually initiated and led a methods-development program outperforming field standards for ancestry correction (PCA), demonstrating improved GWAS calibration via nonlinear population structure modeling.

Developed reusable command-line tooling to batch-process hundreds of thousands of files with consistent transformations, logging, and failure recovery in Linux/HPC environments and cloud platforms (AWS-backed DNAnexus).

University of Alabama at Birmingham
|

Research Assistant

Birmingham, AL, US

Summary

Conducted neuroimaging research at the University of Alabama at Birmingham, focusing on functional MRI data analysis to map brain regions.

Highlights

Analyzed fMRI data to precisely map the primary somatosensory cortex, contributing to understanding brain function and neural pathways.

Utilized advanced computational tools and scripting (R, Python, bash) for efficient data processing and visualization in neuroimaging studies.

Executed end-to-end analysis lifecycles, from data ingestion to interpretation, for complex fMRI datasets.

Collaborated with principal investigators to design and implement experimental protocols for neuroimaging studies, ensuring data integrity and scientific rigor.

University of Alabama at Birmingham
|

Research Assistant

Birmingham, AL, US

Summary

Investigated the role of the BIN1 gene in modulating network hyperexcitability within Alzheimer's disease pathology at the University of Alabama at Birmingham.

Highlights

Conducted research to determine the impact of BIN1 on network hyperexcitability, a key pathological feature in Alzheimer's disease.

Utilized advanced laboratory techniques and data analysis to characterize gene function and its physiological effects.

Collaborated with a multidisciplinary team to advance understanding of neurodegenerative mechanisms and disease progression.

Presented findings at multiple research competitions and conferences, including the Ost Undergraduate Research Competition and the Comprehensive Neuroscience Center Retreat.

Harvard Medical School/Massachusetts General Hospital
|

Harvard-Amgen Scholar

Boston, MA, US

Summary

Contributed to a Harvard-Amgen research initiative, investigating genetic and sex-specific risk factors for Alzheimer's disease through large-scale data analysis.

Highlights

Analyzed large genomic datasets to identify novel genetic and sex-specific risk factors associated with Alzheimer's disease progression.

Executed comprehensive data analysis lifecycles, including data ingestion, quality control, and statistical modeling for genetic association studies.

Applied rigorous statistical diagnostics to ensure robustness and validity of findings in complex genetic analyses.

Presented research findings at the Harvard-Amgen Scholars Presentation, demonstrating strong scientific communication skills to a diverse audience.

University of Alabama at Birmingham
|

Research Assistant

Birmingham, AL, US

Summary

Performed genomic analysis of bacteriophages infecting Corynebacterium at the University of Alabama at Birmingham, focusing on identifying common repeat regions.

Highlights

Analyzed genomic data from bacteriophages to identify and characterize common repeat regions, contributing to microbiology research.

Applied computational methods for sequence analysis and data interpretation in a genomics context.

Participated in scientific discussions and presented findings at the UAB Spring 2018 Undergraduate Research Expo and NCUR 2018, enhancing scientific communication.

Education

University of South Alabama College of Medicine
Mobile, AL, United States of America

Doctor of Medicine (MD)

Medicine

University of Alabama at Birmingham
Birmingham, AL, United States of America

Bachelor of Science; Bachelor of Arts

Neuroscience; Philosophy

Awards

2nd Place, National Ethics Bowl Competition

Awarded By

APPE (Association for Practical and Professional Ethics)

Award-winning performance in a national applied ethics debate competition, demonstrating strong analytical and ethical reasoning skills.

Publications

Association Between Mosaic Loss of Y Chromosome and Atrial.

Published by

Journal of the American College of Cardiology.

Summary

Co-first author publication investigating the association between mosaic loss of Y chromosome and atrial conditions.

Matching Heterogeneous Cohorts by Projected Principal Components Reveals Two Novel Alzheimer's Disease-Associated Genes in the Hispanic Population.

Published by

Alzheimers Dement.

Summary

Publication identifying novel Alzheimer's disease-associated genes in the Hispanic population using principal component analysis for cohort matching.

Actions, Reasons, and the Unconscious.

Published by

Philosophy, Psychiatry, & Psychology.

Summary

Manuscript under review exploring the philosophical concepts of actions, reasons, and the unconscious in an interdisciplinary journal.

A PKCn missense mutation enhances Golgi-localized signaling and is associated with recessively inherited familial Alzheimer's disease.

Published by

Science Signaling.

Summary

Research identifying a specific PKCn missense mutation linked to enhanced Golgi-localized signaling and familial Alzheimer's disease.

Associations of APOE variants with sphingomyelin and cholesterol metabolites across the life-course in diverse populations.

Published by

Metabolomics.

Summary

Study investigating the associations between APOE variants and lipid metabolites across various populations and life stages.

Identification of 16 novel Alzheimer's disease susceptibility loci using multi-ancestry meta-analyses of clinical Alzheimer's disease and AD-by-proxy cases from four whole genome sequencing datasets.

Published by

Alzheimers Dement.

Summary

Co-first author publication identifying 16 novel Alzheimer's disease susceptibility loci through multi-ancestry meta-analyses of whole genome sequencing data.

Assessing polyomic risk to predict Alzheimer's disease using a machine learning model.

Published by

Alzheimers Dement.

Summary

Research assessing polyomic risk factors for Alzheimer's disease prediction using advanced machine learning models.

Alzheimer's disease risk gene BIN1 induces Tau-dependent network hyperexcitability.

Published by

eLife.

Summary

Publication detailing how the Alzheimer's disease risk gene BIN1 induces Tau-dependent network hyperexcitability.

Skills

Data Analysis & Statistics

Large-scale Omics Data Analysis, Multi-terabyte Data Processing, AI/ML Programs, WGS Analysis, Metabolomic Analysis, Proteomic Analysis, Cross-cohort Inference, Population Scale Analysis, Statistical Diagnostics, Model Reasoning, Population Structure Artifacts, Confounding vs. Collider Bias, GWAS Calibration, Ancestry Correction (PCA), Nonlinear Population Structure Modeling, Unsupervised Nonlinear Manifold Learning, Spectral Graph Theory, Markov Dynamics, Graph Laplacians, Kernel Methods, Manifold Geometry, Category Theory, fMRI Data Analysis, Genomic Sequence Analysis.

Computational Tools & Platforms

R, Python, Bash, Linux/HPC (SLURM), GPU-accelerated Workflows (NVIDIA), Cloud Platforms (AWS-backed DNAnexus), PLINK, Regenie, beftools, samtools, GenomeStudio, Illumina Platforms, Command-line Tooling.

Genomics & Omics

Whole Genome Sequencing (WGS), Genetic Architecture, Alzheimer's Disease Genetics, Biobank Data (UK Biobank, All of Us, NIAGADS, MGB Biobank), mLOY Detection, APOE Variants, Sphingomyelin, Cholesterol Metabolites, BIN1 Gene, Bacteriophages, Corynebacterium.

Workflow & Pipeline Development

Workflow Optimization, Production Scale Workflows, End-to-End Analytical Ownership, Reusable Command-line Tooling, Batch Processing, Data Ingestion, Quality Control (QC), Model Specification, Execution Diagnostics, Interpretation, Downstream Validation, Failure Recovery, Domain-Specific Pipeline Creation.

Leadership & Collaboration

Methods Development Leadership, Technical Leadership, Internal Workshops, Codebase Sharing, Mentoring, Cross-functional Collaboration, Stakeholder Engagement.

Research & Scientific Method

Scientific Investigation, Experimental Design, Data Interpretation, Problem-Solving, Reproducible Workflows, Scientific Writing, Peer Review, Research Presentations.

Ethics & Philosophy

Applied Ethics, Moral Philosophy, Bioethics, Philosophy of Action.

Interests

Bioethics

Applied Ethics, Moral Philosophy, Neuroethics.