Taseen Hossain
Data Science & Analytics

Taseen Hossain

Skilled in statistical programming, clinical data analysis, and health analytics — with hands-on experience in SAS, Python, SQL, and Tableau. Focused on turning structured data into reliable, evidence-based insights.

See Projects About Me

Projects

SAS · Clinical Data

Heart Disease Risk Stratification

Built a full SAS reporting pipeline to generate baseline demographics and stratified cardiovascular risk summaries from a 5,209-record clinical dataset. Cleaned and validated data, recoded missing categorical variables, and automated professional clinical-style PDF reports using ODS.

SAS DATA Step PROC MEANS PROC FREQ PROC SQL ODS PDF
IBM Capstone

Technology Trends Analysis

Performed end-to-end data wrangling and exploratory analysis to identify emerging trends in programming languages, databases, and frameworks. Built interactive dashboards and presented findings through data storytelling techniques.

Python Pandas SQL Cognos Analytics Excel
Machine Learning

Predicting Wine Quality

Preprocessed the Kaggle Wine Quality dataset using PCA and K-means clustering. Developed and evaluated a Decision Tree classifier identifying alcohol content as the strongest quality predictor, visualized through confusion matrices and scatterplots.

scikit-learn Pandas NumPy Matplotlib PCA

Experience

Indiana University High School Remote
Apr 2025 – Present

Web Developer (Part-Time)

  • Build and maintain multiple online courses using HTML/CSS and Canvas LMS, ensuring content accuracy and consistency across projects.
  • Conduct accessibility audits and quality checks to meet WCAG compliance standards and improve data integrity.
  • Collaborate with academic staff to review and implement feedback, aligning materials with learning objectives.
Bambuddha Group Sydney, Australia
Jun – Aug 2024

Data & Market Analysis Intern

  • Conducted competitor research and market analysis to identify business trends and inform strategic recommendations.
  • Structured and maintained research datasets in Excel, applying data organization practices to improve accessibility and usability for the team.
  • Supported SEO strategy development by analyzing web performance data and surfacing actionable insights to improve search visibility.
Indiana University Bloomington, IN
Feb – May 2023

Undergraduate Mentor — Informatics 101

  • Taught Python, Excel, Google Data Studio, and HTML/CSS to introductory students across data analysis and visualization topics.
  • Provided individualized technical support and simplified complex concepts for non-technical audiences.

About Me

Taseen Hossain

Data Science & Analytics
  • Pursuing a B.S. + M.S. in Data Science through an accelerated 4+1 program at Indiana University
  • SAS programming — clinical data cleaning, audit trails, and automated reporting
  • Python — EDA, statistical modeling, machine learning with Pandas, NumPy, scikit-learn
  • SQL, Tableau, IBM Cognos Analytics, and Excel for analysis and BI
  • Focused on clinical data analysis, health analytics, and statistical programming roles
  • IBM Certified Data Analyst
  • Phi Eta Sigma Honor Society

I have a strong foundation in data analysis, with experience working across Python, SQL, SAS, Excel, Tableau, and IBM Cognos Analytics to clean, analyze, and interpret data for real-world insights. Through academic projects and professional experience, I've worked on building structured datasets, performing data quality checks, and supporting data-driven decision-making.

My SAS experience includes DATA step processing, PROC MEANS, PROC FREQ, PROC SQL, and ODS PDF reporting — tools I've used to clean, audit, and generate reports on structured clinical-style datasets. One of my key projects involved building an end-to-end reporting pipeline that produced baseline demographics and stratified cardiovascular risk summaries from over 5,000 records.

With Python, I work primarily in Pandas, NumPy, Matplotlib, and scikit-learn — applying these across exploratory data analysis, statistical modeling, and machine learning workflows. I'm drawn to work at the intersection of data and health, particularly roles in clinical data analysis, health analytics, and statistical programming.

I am actively seeking opportunities where I can contribute to meaningful, evidence-based decisions while continuing to grow through my graduate studies.

Skills

SAS

  • DATA Step Processing
  • PROC MEANS
  • PROC FREQ
  • PROC SQL
  • ODS PDF Reporting
  • PROC SORT

Python & SQL

  • Pandas & NumPy
  • Matplotlib & Seaborn
  • scikit-learn
  • SQL (Joins, Subqueries)
  • R / RStudio

BI & Reporting

  • Tableau
  • IBM Cognos Analytics
  • Excel (Pivot, VLOOKUP)
  • Google Data Studio

Other

  • HTML & CSS
  • Canvas LMS
  • WCAG Accessibility
  • SEO & Market Research

Education

Indiana University — Bloomington

Accelerated B.S. + M.S. in Data Science (4+1 Program)
Bloomington, IN

Relevant Coursework: Exploratory Data Analysis · Data Mining · Big Data Analytics · Applied Linear Models · Statistical Inference · Principles of Machine Learning

IBM Data Analyst Certified Phi Eta Sigma Honor Society Pre-APALSA Treasurer

Interested in connecting?

Open to opportunities in clinical data analysis, health analytics, and statistical programming. Feel free to reach out.