DataCamp - OnlineRetail dataset, Analysis #2 - SQL with Notebook
- Problem definition of E-Commerce Data
- Data Dictionary - explaining the content, the data types and the meaning of specific values or signs
- "Don't know where to start?" session defines Exploration, Analysis, and Visualisation challenges:
- Explore: Negative order quantities indicate returns. Which products have been returned the most?
- Visualize: Create a plot visualizing the profits earned from UK customers weekly, monthly.
- Analyze: Are order sizes from countries outside the United Kingdom significantly larger than orders from inside the United Kingdom?
With SQL visualization, and statistical analysis is not possible, but the data can be extracted for further analysis. DataCamp website in-built Notebook functionality can be of help in case of visualization, but t-probe statistics is still not doable on a pure SQL basis.
See the Python-based study or the Power BI solution on the same topic.
* Compatible Notebook readers/editors: DataCamp online Notebook (requires registration) Please note that Jupyter Notebook versions of any kind are not compatible automatically with any kind of SQL interpreters. There are ways (search online for solutions), but I recommend reading the pdf version provided above.
No comments:
Post a Comment