Mazhar Hossain

Python backend and data engineer building reliable data platforms, ML-ready pipelines & production APIs for analytics and forecasting.

Research & mHealth ML Systems Data Platforms
Milwaukee, WI Open to collaboration

6+ years

Industry experience

1k+

StackOverflow

200+

Scholar citations

Impact Highlights

  • Managed pipelines from 500+ data sources
  • Reduced debugging time by 95% via automated verification
  • Cut new-source onboarding effort by 60%
  • Improved incident response time by 35%

Profile

Python backend and data engineer with 6+ years of experience across data pipelines, ML systems & production APIs. Focused on data quality, observability & scalable ingestion frameworks. Research Assistant at Marquette University (mHealth), currently developing ML-enabled mHealth data collection platforms.

Data Integrity Monitoring Prediction Automation

Top Skills

  • Python & Backend Systems
  • ETL & Automation
  • Data Quality & Observability

0

Years in data & ML engineering

0

Data sources integrated

0

Stack Overflow reputation

SO

0

Google Scholar citations

Scholar

Experience

Research, data engineering & ML-focused roles.

Marquette University Research Assistant
Jan 2026 - Present · Milwaukee, WI

mHealth research, applied ML & data processing for health-focused studies.

  • Research on mHealth applications and applied ML.
  • Data processing and analysis for health-focused research studies.
ResearchMLHealthcare
Cefalo Software Engineer → Senior Software Engineer
Sep 2021 - Dec 2025 · Dhaka, Bangladesh

Data platform reliability, 500+ source pipelines & 95% faster debugging via automation.

  • Built energy market forecasting data products and RESTful backends.
  • Built RESTful services for urgent market data ingestion.
  • Managed pipelines from 500+ sources with high integrity and availability.
  • Automated verification processes, reducing debugging time by 95%.
  • Redesigned ingestion framework, cutting new-source effort by 60%.
  • Implemented monitoring for critical sources, reducing response time by 35%.
  • Contributed to data platform reliability and automation.
PythonETLMonitoringAPIsREST
Upwork Python Developer (Freelance)
Feb 2021 - Dec 2022 · Remote

ETL delivery, system stabilization & 100% job success rate.

  • Delivered ETL pipelines across diverse data sources.
  • Stabilized systems by fixing and optimizing existing scripts.
  • Produced clear documentation for workflows and installations.
  • Maintained 100% job success rate.
ETLAutomationDocumentation
BJIT AI Engineer
Oct 2019 - Aug 2021 · Dhaka, Bangladesh

Computer vision (74% detection accuracy), NLP chatbots (+43% accuracy) & recommender systems.

  • Improved dataset quality by 30% through augmentation and labeling.
  • Delivered drone small-object detection with 74% accuracy.
  • Improved chatbot response accuracy by 43% with model training and API integration.
  • Built recommender system using collaborative and content-based filtering.
PyTorchNLPComputer Vision

Education

M.Sc.

Marquette University

Master’s in Data Science · 2026 - 2027

Research Assistant (mHealth)

B.Sc.

Khulna University of Engineering & Technology

B.Sc. in ECE · 2014 - 2018

Electronics & Communication Engineering

Employing PCA and t-statistical approach for feature extraction and classification of emotion from multichannel EEG signal

Egyptian Informatics Journal · Nov 2, 2019

  • Hybrid PCA + t-statistics features for EEG emotion recognition on SEED.
  • Evaluated with SVM, ANN, LDA, k-NN across subject-dependent and independent setups.
  • Best accuracy: 86.57% (ANN, subject-dependent).

Skills

Python

Python

ETL, automation & production APIs.

PostgreSQL

PostgreSQL

Data modeling, optimization & reporting.

Django

Django

REST APIs and data-driven applications.

Docker

Docker

Containerized deployments and local dev flows.

Airflow ETL Web Scraping PyTorch Scikit-learn RASA OpenCV GitHub Actions Prometheus Grafana

Projects

Chatbot Platform

Intent- and entity-based chatbot system with dataset tooling, multi-profile support & Flask APIs.

RASAFlaskNLPCUDA

Small Object Detection

Trained YOLOv3 and Faster R-CNN pipelines for drones, birds, and underwater fish with performance reporting.

PyTorchOpenCVTensorBoard

Django Web App

Web app for server lookup by IP and content recommendations.

DjangoHTML/CSSREST

Scraping Solutions

Data extraction tools using multiple scraping frameworks with production-ready data cleaning.

BeautifulSoupSeleniumScrapy

Certifications

OWASP Top 10:2021 in Python

Secure CodingVulnerability AssessmentSecurity Testing

SQL for Data Science

Data ModelingDatabase Design Data AnalysisData Quality SQL

Deep Learning Specialization

Image AnalysisConvolution Neural Networks RNNApplied Machine Learning Transfer Learning

Machine Learning

Supervised LearningUnsupervised Learning Feature EngineeringModel Evaluation NumpyTensorFlow

Let’s Connect

I’m open to opportunities in data engineering, analytics & ML systems. Reach out for collaborations or consulting.