Research Dataset Recommendation System
Matches research questions to datasets across 25+ sources; reduces search time from ~2 hours to <5 minutes.
// data-analytics · business-intelligence · behavioral-research · nlp
I build data pipelines, translate data into actionable business insights, conduct behavioral and product research, and apply NLP to real research problems.
A snapshot. View full proficiency breakdown →
Four fast reads for recruiters. Full list is on the Projects page.
Matches research questions to datasets across 25+ sources; reduces search time from ~2 hours to <5 minutes.
Medallion-layered pipeline (raw/cur/meta) for Azure SQL with watermark-based incremental loads and monitoring.
Analyzes retraction-flag consistency and tracks indexing drift (208 DOIs) impacting research integrity.
End-to-end fab operations pipeline with 4-layer medallion architecture, YAML rule engine (15+ validation checks), watermark-based incremental ingestion, and yield/equipment health analytics.