As promised in my last post after passing the DP-900 Azure Data Fundamentals exam here's my exam preparation guide which provide links to the various resources I used in studying for the different topics.
DESCRIBE CORE DATA CONCEPTS (15-20%)
Describe types of core data workloads
Describe batch data:
https://www.talend.com/resources/batch-processing/
https://docs.microsoft.com/en-us/azure/architecture/data-guide/big-data/batch-processing
Describe streaming data:
https://hazelcast.com/glossary/stream-processing/
Describe the difference between batch and streaming data:
https://docs.microsoft.com/en-us/learn/modules/explore-core-data-concepts/4-describe-difference
Describe the characteristics of relational data:
https://docs.microsoft.com/en-us/learn/modules/explore-core-data-concepts/3-identify-types-storage
Describe data analytics core concepts
Describe data
visualization (e.g., visualization, reporting, business intelligence (BI)):
https://powerbi.microsoft.com/en-us/data-visualization/
Describe basic chart
types such as bar charts and pie charts:
https://www.klipfolio.com/resources/articles/what-is-data-visualization
Describe analytics techniques (e.g., descriptive, diagnostic, predictive, prescriptive, cognitive):
https://docs.microsoft.com/en-us/learn/modules/data-analytics-microsoft/2-data-analysis
https://www.weirdgeek.com/2018/11/types-of-analytics/
Describe ELT and ETL
processing:
https://www.talend.com/resources/elt-vs-etl/
Describe the concepts
of data processing;
https://www.mbaknol.com/management-information-systems/concept-of-data-processing/
DESCRIBE HOW TO WORK WITH RELATION DATA ON AZURE (25-30%)
Describe relational data workloads
Identify the right
data offering for a relational workload:
https://docs.microsoft.com/en-us/azure/architecture/guide/technology-choices/data-store-overview
Describe relational
data structures (e.g., tables, index, views):
https://web.csulb.edu/colleges/coe/cecs/dbdesign/dbdesign.php?page=sql/views.php
Describe relational Azure data services
Describe and compare
PaaS, IaaS, and SaaS delivery models:
https://www.bmc.com/blogs/saas-vs-paas-vs-iaas-whats-the-difference-and-how-to-choose/
https://docs.microsoft.com/en-us/learn/modules/principles-cloud-computing/5-types-of-cloud-services
Describe Azure SQL
Database:
https://azure.microsoft.com/en-gb/services/sql-database/
Describe Azure
Synapse Analytics:
https://azure.microsoft.com/en-gb/services/synapse-analytics/
Describe SQL Server
on Azure Virtual Machine
Describe Azure Database for PostgreSQL, Azure Database for MariaDB, and Azure Database for MySQL:
Describe Azure SQL
Managed Instance:
https://docs.microsoft.com/en-us/azure/azure-sql/managed-instance/sql-managed-instance-paas-overview
Identify basic management tasks for relational data
Describe provisioning and deployment of relational data services:
Describe method for deployment including the Azure portal, Azure Resource Manager
templates, Azure PowerShell, and the Azure command-line interface (CLI):
https://docs.microsoft.com/en-us/learn/modules/deploy-azure-sql-database/3-deploy-single
Identify data security components (e.g., firewall, authentication):
Identify basic connectivity issues (e.g., accessing from on-premises, access with Azure VNets, access from Internet, authentication, firewalls):
https://docs.microsoft.com/en-us/azure/azure-sql/database/troubleshoot-common-errors-issues
https://medium.com/@vmehmeri/troubleshooting-connectivity-in-microsoft-azure-2035c7a43d3f
Identify query tools (e.g., Azure Data Studio, SQL Server Management Studio, sqlcmd utility, etc.):
https://docs.microsoft.com/en-us/sql/azure-data-studio/what-is?view=sql-server-ver15
https://docs.microsoft.com/en-us/sql/ssms/sql-server-management-studio-ssms?view=sql-server-ver15
https://docs.microsoft.com/en-us/sql/ssms/scripting/sqlcmd-use-the-utility?view=sql-server-ver15
https://code.visualstudio.com/
Describe query techniques for data using SQL language
Compare DDL versus
DML:
https://docs.microsoft.com/en-us/learn/modules/query-relational-data/2-introduction-to-sql
https://stackoverflow.com/questions/2578194/what-are-ddl-and-dml
Query relational data
in PostgreSQL, MySQL, and Azure SQL Database:
DESCRIBE HOW TO WORK WITH NON-RELATIONAL DATA ON AZURE (25-30%)
Describe non-relational data workloads
Describe the
characteristics of non-relational data:
Describe the types of
non-relational and NoSQL data:
https://azure.microsoft.com/en-gb/overview/nosql-database/
Recommend the correct
data store:
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage
Determine when to use
non-relational data:
https://azure.microsoft.com/en-gb/overview/nosql-database/
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage (capability matrices)
Describe non-relational data offerings on Azure
Identify Azure data
services for non-relational workloads:
https://docs.microsoft.com/en-us/learn/modules/explore-non-relational-data-offerings-azure/
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage
Describe Azure Cosmos
DB APIs:
https://docs.microsoft.com/en-us/learn/modules/choose-api-for-cosmos-db/
https://nordcloud.com/azure-cosmos-db/
Describe Azure Table
storage:
Describe Azure Blob
storage:
Describe Azure File
storage:
Identify basic management tasks for non-relational data
Describe provisioning
and deployment of non-relational data services:
Describe method for deployment including the Azure portal, Azure Resource Manager
templates, Azure PowerShell, and the Azure command-line interface (CLI):
Identify data
security components (e.g., firewall, authentication, encryption):
Identify basic connectivity issues (e.g., accessing from on-premises, access with Azure VNets, access from Internet, authentication, firewalls)
Identify management
tools for non-relational data:
https://docs.microsoft.com/en-us/learn/modules/explore-non-relational-data-stores-azure/
DESCRIBE AN ANALYTICS WORKLOAD ON AZURE (25-30%)
Describe analytics workloads
Describe
transactional workloads:
Describe the
difference between a transactional and an analytics workload:
https://www.guru99.com/oltp-vs-olap.html
Describe the
difference between batch and real time:
https://www.datasciencecentral.com/profiles/blogs/batch-vs-real-time-data-processing
Describe data
warehousing workloads:
https://docs.microsoft.com/en-us/learn/modules/explore-data-ingestion-azure/
Determine when a data
warehouse solution is needed:
Describe the components of a modern data warehouse
Describe Azure data services for modern data warehousing such as Azure Data Lake, Azure Synapse Analytics, Azure Databricks, and Azure HDInsight:
https://azure.microsoft.com/en-gb/solutions/data-lake/
https://azure.microsoft.com/en-gb/services/databricks/
Describe modern data
warehousing architecture and workload:
https://docs.microsoft.com/en-us/learn/modules/examine-components-of-modern-data-warehouse/
Describe data ingestion and processing on Azure
Describe common practices for data loading:
Describe the
components of Azure Data Factory (e.g., pipeline, activities, etc.):
https://www.cathrinewilhelmsen.net/2019/12/03/overview-azure-data-factory-components/
Describe data processing options (e.g., HDI, Azure Databricks, Azure Synapse Analytics, Azure Data Factory):
Describe data visualization in Microsoft Power BI
Describe the role of
paginated reporting:
Describe the role of
interactive reports:
https://powerbi.microsoft.com/en-us/desktop/
Describe the role of
dashboards:
Describe the workflow
in Power BI: