This project focuses on analyzing job offers' salary data using Power BI for visualization, combined with Talend for ETL processes. The dashboard provides valuable insights into salary distribution by job title, sector, location, and company size. The project is based on a star schema design, with a fact table for salaries and multiple dimension tables to describe jobs, locations, sectors, and company sizes.
The dashboard includes the following key visualizations and metrics:
-
Sum of Average Salaries: This card displays the total sum of the average salaries across all job offers.
-
Sum of Minimum Salaries: A card showing the total sum of the minimum salaries for the job offers analyzed.
-
Sum of Maximum Salaries: The total sum of the maximum salaries offered for the available positions.
-
Salary by Job Title: A bar chart showing the top 4 jobs with the highest average salaries, including positions like:
- Data Scientist
- Data Engineer
- Senior Data Scientist
- Senior Data Engineer
-
Salary by Sector: A bar chart visualizing the average, minimum, and maximum salaries across various sectors, including:
- Information Technology
- Biotech & Pharmaceuticals
- Business Services
- Finance, and more.
-
Salary by Location: A bar chart displaying the average salary distribution across different U.S. cities, with notable mentions like:
- San Francisco, CA
- New York, NY
- Cambridge, MA
-
Number of Job Offers by Company Type: A breakdown of job offers by the size of the company (large, medium, or small).
This project follows a star schema structure, where the fact table contains salary-related information and is linked to multiple dimension tables that categorize jobs, locations, sectors, and company sizes.
This table stores salary-related metrics:
avg_salary
: The average salary offered for the job.min_salary
: The minimum salary offered.max_salary
: The maximum salary offered.- Foreign keys to the dimension tables (Job, Location, Sector, Size).
projet_sid_dim_job
: Contains job titles.projet_sid_dim_location
: Describes job locations.projet_sid_dim_sector
: Holds sector information.projet_sid_dim_size
: Contains the company size classification (small, medium, large).
- Talend: Used for extracting, transforming, and loading (ETL) data into the final format.
- Power BI: For data visualization and dashboard creation.
- SQL: For database management and data preparation.
- Dataset: The raw data is related to job offers and salary information, classified by job title, location, sector, and company size.
- ETL Process: The data was cleaned and transformed using Talend before loading it into Power BI.
- Power BI Dashboard: The dashboard can be explored to uncover insights into salary distribution by various dimensions.
- Expand the dataset to include more job roles and industries.
- Add time-series analysis to monitor salary trends over time.
- Include filters for more granular data exploration, such as by specific cities or job categories.
Description: Job Offer Salary Analysis Dashboard Overview