-
Course Objective 1 min
- Course Admin
-
Prerequisites and Supporting Content 1 min
-
Adding Platform Data Integration IPs to the Warehouse trusted list of IPs 3 min
-
Install the Platform ODBC Data Driver for Windows 7 min
-
Install the Platform ODBC Data Driver for Linux 4 min
-
Install the Platform Data Driver for Apple MAC 5 min
-
Jupyter Notebook: Data Preparation Iris Dataset 1 min
- Native Data Loading Functionality
-
Section Objective 1 min
- Data Integration: Load Files
-
Load a local data file 5 min
- Native Data Integration
-
Overview 6 min
-
Configuration Creation 5 min
-
Simple Mapper 6 min
-
Editing Existing Simple Mapper Maps 4 min
-
Using the Simple Mapper Expression Builder 6 min
-
Files 6 min
-
Jobs 3 min
-
Macros 6 min
-
Agents 4 min
-
Scheduling Jobs wit a CRON Expression 3 min
-
Job Log 3 min
- Native Data Integration Templates
-
GCP Storage to GCP Platform Warehouse 5 min
-
GCP Delimited Data File to GCP Platform Warehouse 4 min
-
NetSuite to a GCP Platform Warehouse 3 min
- DataConnect Integrations
-
Section Objective 1 min
- DataConnect: Design Integrations on Premise, Deploy and Run in Avalanche Connect
-
Create an Integration using the DataConnect Process Wizard 6 min
-
Process and Map Views 2 min
-
Define Macros and Create a Package 5 min
-
Deploy package in the Cloud and Run the Configuration 3 min
-
Update a Package, Deploy and Run the Updated Package 7 min
- DataConnect Data Profiler
-
Introduction to Data Profiler 5 min
- Ingesting Data using External Table Functionality
-
Overview 2 min
-
Load data from a Microsoft Azure Blob Store 5 min
-
Load data (CSV and Parquet) from an Amazon S3 Bucket 7 min
-
Load data from a GCP Bucket using Spark via the External Table SQL Statement 5 min
- Data Preparation using the Iris Dataset
-
Create a Connection and Supporting Functions 3 min
-
Create a Table 2 min
-
Loading data into a Platform Warehouse table 7 min
-
Copying and Enlarging the Data Set 2 min
- Optional Content
-
Six Essential Data Preparation Steps for Analytics 1 min
- Feedback
-
Take Course Survey
Actian Data Platform for Data Scientists: Data Preparation
The training provided within this course provides you with knowledge for the various native Platform capabilities to ingest, clean and transform data and access it all in a single data platform.
Course Outcome:
You will understand how simple it is to clean, transform and ingest data into a Platform Warehouse using the wide range of native capabilities provided as part of the Actian Data Platform.
Course Style:
The course is provided in a step-by-step fashion to ensure you understand the process to clean and transform and ingest data into Actian Data Platform.
Audience:
For Data Scientists and Data Architects, and similar personas in the organization responsible for business intelligence, analytics, and data visualization.
Prerequisites:
- No knowledge of the Actian Data Platform is required
- Some knowledge of the data science workflow is assumed
- Actian Data Platform login credentials and access to the Warehouse data are assumed to be available to you
Supplementary Resources:
Product Documentation:
|
|
Actian Community:
|
|
Software Download:
|