Pentaho Data Integration Fundamentals Training Course
Pentaho Data Integration is an open-source data integration tool for defining jobs and data transformations.
In this instructor-led, live training, participants will learn how to use Pentaho Data Integration's powerful ETL capabilities and rich GUI to manage an entire big data lifecycle and maximize the value of data within their organization.
By the end of this training, participants will be able to:
- Create, preview, and run basic data transformations containing steps and hops
- Configure and secure the Pentaho Enterprise Repository
- Harness disparate sources of data and generate a single, unified version of the truth in an analytics-ready format.
- Provide results to third-part applications for further processing
Audience
- Data Analyst
- ETL developers
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Course Outline
Introduction
Installing and Configuring Pentaho
Overview of Pentaho Features and Architecture
Understanding Pentaho's In-Memory Caching
Navigating the User Interface
Connecting to a Data Source
Configuring the Pentaho Enterprise Repository
Transforming Data
Viewing the Transformation Results
Resolving Transformation Errors
Processing a Data Stream
Reusing Transformations
Scheduling Transformations
Securing Pentaho
Integrating with Third-party Applications (Hadoop, NoSQL, etc.)
Analytics and Reporting
Pentaho Design Patterns and Best Practices
Troubleshooting
Summary and Conclusion
Requirements
- An understanding of relational databases
- An understanding of data warehousing
- An understanding of ETL (Extract, Transform, Load) concepts
Open Training Courses require 5+ participants.
Pentaho Data Integration Fundamentals Training Course - Booking
Pentaho Data Integration Fundamentals Training Course - Enquiry
Pentaho Data Integration Fundamentals - Consultancy Enquiry
Consultancy Enquiry
Testimonials (2)
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
Course - KNIME Analytics Platform for BI
It's a hands-on session.
Vorraluck Sarechuer - Total Access Communication Public Company Limited (dtac)
Course - Talend Open Studio for ESB
Upcoming Courses
Related Courses
KNIME Analytics Platform for BI
21 HoursKNIME Analytics Platform is a leading open source option for data-driven innovation, helping you discover the potential hidden in your data, mine for fresh insights, or predict new futures. With more than 1000 modules, hundreds of ready-to-run examples, a comprehensive range of integrated tools, and the widest choice of advanced algorithms available, KNIME Analytics Platform is the perfect toolbox for any data scientist and business analyst.
This course for KNIME Analytics Platform is an ideal opportunity for beginners, advanced users and KNIME experts to be introduced to KNIME, to learn how to use it more effectively, and how to create clear, comprehensive reports based on KNIME workflows
Platforma analityczna KNIME - szkolenie kompleksowe
35 HoursThe "Analytics Platform KNIME" training offers a comprehensive overview of this free data analytics platform. The program includes an introduction to data processing and analysis, installation and configuration KNIME, building workflow, methodology for creating business models and data modeling. The course also covers advanced data analysis tools, workflow import and export, tool integration, ETL processes, data mining, visualization, extensions and integrations with tools such as R, Java, Python, Gephi, Neo4j. The conclusion includes an overview of reporting, integration with BIRT and KNIME WebPortal.
Oracle GoldenGate
14 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at sysadmins and developers who wish to set up, deploy, and manage Oracle GoldenGate for data transformation.
By the end of this training, participants will be able to:
- Install and configure Oracle GoldenGate.
- Understand Oracle databases replication using the Oracle GoldenGate tool.
- Understand the Oracle GoldenGate architecture.
- Configure and perform a database replication and migration.
- Optimize Oracle GoldenGate performance and troubleshoot issues.
Pentaho Open Source BI Suite Community Edition (CE)
28 HoursPentaho Open Source BI Suite Community Edition (CE) is a business intelligence package that provides data integration, reporting, dashboards, and load capabilities.
In this instructor-led, live training, participants will learn how to maximize the features of Pentaho Open Source BI Suite Community Edition (CE).
By the end of this training, participants will be able to:
- Install and configure Pentaho Open Source BI Suite Community Edition (CE)
- Understand the fundamentals of Pentaho CE tools and their features
- Build reports using Pentaho CE
- Integrate third party data into Pentaho CE
- Work with big data and analytics in Pentaho CE
Audience
- Programmers
- BI Developers
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Note
- To request a customized training for this course, please contact us to arrange.
Sensor Fusion Algorithms
14 HoursSensor Fusion is the combination and integration of data from multiple sensors to provide a more accurate, reliable and contextual view of data.
Sensor Fusion implementations require algorithms to filter and integrate different data sources.
Audience
This course is targeted at engineers, programmers and architects who deal with multi-sensor implementations.
Talend Administration Center (TAC)
14 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at system administrators, data scientists, and business analysts who wish to set up Talend Administration Center to deploy and manage the organization's roles and tasks.
By the end of this training, participants will be able to:
- Install and configure Talend Administration Center.
- Understand and implement Talend management fundamentals.
- Build, deploy, and run business projects or tasks in Talend.
- Monitor the security of datasets and develop business routines based on the TAC framework.
- Obtain a broader comprehension of big data applications.
Talend Big Data Integration
28 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at technical persons who wish to deploy Talend Open Studio for Big Data to simplifying the process of reading and crunching through Big Data.
By the end of this training, participants will be able to:
- Install and configure Talend Open Studio for Big Data.
- Connect with Big Data systems such as Cloudera, HortonWorks, MapR, Amazon EMR and Apache.
- Understand and set up Open Studio's big data components and connectors.
- Configure parameters to automatically generate MapReduce code.
- Use Open Studio's drag-and-drop interface to run Hadoop jobs.
- Prototype big data pipelines.
- Automate big data integration projects.
Talend Cloud
7 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at data administrators and developers who wish to manage, monitor, and operate data integration processes using Talend Cloud services.
By the end of this training, participants will be able to:
- Navigate the Talend Management Console to manage users and roles in the platform.
- Evaluate data to find and understand relevant datasets.
- Create a pipeline to process and monitor data at rest or in action.
- Prepare data for analysis to generate insights relevant to the business.
Talend Data Stewardship
14 HoursThis instructor-led, live training in Norway (online or onsite) is aimed at beginner to intermediate-level data analysts who wish to deepen their understanding and skills in managing and improving data quality using Talend Data Stewardship.
By the end of this training, participants will be able to:
- Gain a comprehensive understanding of the role of data stewardship in maintaining data quality.
- Use Talend Data Stewardship for managing data quality tasks.
- Create, assign, and manage tasks within Talend Data Stewardship, including workflow customization.
- Use the tool's reporting and monitoring capabilities to track data quality and stewardship efforts.
Talend Open Studio for ESB
21 HoursIn this instructor-led, live training in Norway, participants will learn how to use Talend Open Studio for ESB to create, connect, mediate and manage services and their interactions.
By the end of this training, participants will be able to
- Integrate, enhance and deliver ESB technologies as single packages in a variety of deployment environments.
- Understand and utilize Talend Open Studio's most used components.
- Integrate any application, database, API, or Web services.
- Seamlessly integrate heterogeneous systems and applications.
- Embed existing Java code libraries to extend projects.
- Leverage community components and code to extend projects.
- Rapidly integrate systems, applications and data sources within a drag-and-drop Eclipse environment.
- Reduce development time and maintenance costs by generating optimized, reusable code.