Skip to content

trick-1/redback-data-warehouse

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

redback-data-warehouse

Data Warehouse storage of code and configurations

Garmin Run Data – ETL Pipeline Update

This ETL pipeline processes Garmin_run_data.csv and includes:

Data cleaning:

  • Removes duplicate rows
  • Standardizes column names (lowercase, underscores)
  • Converts timestamps to datetime
  • Fills missing numeric values with column means
  • Removes outliers in heart_rate (keeps values between 30–220 bpm)
  • Converts distance from meters to kilometers
  • Converts speed from m/s to km/h

Data aggregation:

  • Groups data by year and week
  • Calculates total runs, total distance (km), average speed (km/h), and average pace (min/km) per week

Outputs:

  • cleaned_garmin_run_data.csv → cleaned dataset

About

Data Warehouse storage of code and configurations

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 53.4%
  • Jupyter Notebook 40.8%
  • HTML 2.8%
  • CSS 1.2%
  • Dockerfile 1.1%
  • PHP 0.5%
  • Shell 0.2%