Nairobi, Kenya

0728 269396

Serverless Data Engineering Training Course

In today’s cloud-native era, Serverless Data Engineering is revolutionizing how organizations build and manage data workflows. By eliminating the need to provision and manage infrastructure, ser...

Click to Register

ONSITE OR VIRTUAL

May 04 - May 08
Programme Overview
Training Description

Who Should Attend

This course is ideal for;

  1. Cloud Data Engineers
  2. DevOps Engineers working with data infrastructure
  3. Big Data Developers
  4. Cloud Architects
  5. DataOps and MLOps Professionals
  6. Technical Project Managers
  7. Enterprise Data Platform Engineers
  8. Analytics Engineers 
Session Objectives
  • Understand the fundamentals of serverless computing and its benefits for data engineering
  • Learn how to design serverless data pipelines for batch and streaming data
  • Explore integration with cloud-native services for data ingestion, transformation, and output
  • Build event-driven architectures that respond to data triggers efficiently
  • Master monitoring, logging, and alerting in serverless environments
  • Apply cost optimization strategies for serverless workflows
  • Ensure scalability, fault-tolerance, and reliability in serverless pipelines
  • Implement real-time analytics and data lake ingestion
  • Enforce security and governance across serverless components
  • Use infrastructure as code to manage and automate deployments
  • Gain hands-on experience through practical labs and a capstone project
About the Course

In today’s cloud-native era, Serverless Data Engineering is revolutionizing how organizations build and manage data workflows. By eliminating the need to provision and manage infrastructure, serverless architectures empower data engineers to focus on designing highly scalable, resilient, and cost-efficient pipelines. This course equips participants with the skills to build event-driven data processing systems using serverless technologies such as AWS Lambda, Azure Functions, Google Cloud Functions, and more. Learners will explore the best practices for orchestration, data ingestion, real-time processing, monitoring, and governance in a serverless environment, enabling them to accelerate innovation and reduce operational complexity.

Curriculum & Topics

15 Topics | 10 Days

  • play Subtopic 1.1: Overview of serverless computing

  • play Subtopic 1.2: Benefits and challenges for data processing

  • play Subtopic 1.3: Key cloud provider offerings: AWS, GCP, Azure

  • play Subtopic 1.4: Serverless vs. container-based architectures

  • play Subtopic 1.5: Use cases in modern data engineering

  • play Subtopic 2.1: AWS Lambda, Azure Functions, Google Cloud Functions

  • play Subtopic 2.2: Function lifecycle and execution model

  • play Subtopic 2.3: Writing and deploying serverless functions

  • play Subtopic 2.4: Managing concurrency and limits

  • play Subtopic 2.5: Using frameworks like Serverless Framework and SAM

  • play Subtopic 3.1: Sources of events: file drops, API calls, queues

  • play Subtopic 3.2: Triggering pipelines on data events

  • play Subtopic 3.3: Designing event producers and consumers

  • play Subtopic 3.4: Ensuring idempotency and retry strategies

  • play Subtopic 3.5: Chaining and fan-out patterns

  • play Subtopic 4.1: Using Kinesis, Pub/Sub, and Event Hubs

  • play Subtopic 4.2: Handling structured and unstructured data

  • play Subtopic 4.3: Real-time vs. batch ingestion

  • play Subtopic 4.4: Validating and transforming incoming data

  • play Subtopic 4.5: Integrating with APIs and third-party data sources

  • play Subtopic 5.1: Using Amazon S3, Azure Blob, and GCS

  • play Subtopic 5.2: Object lifecycle management and versioning

  • play Subtopic 5.3: Event notifications on file uploads

  • play Subtopic 5.4: Data partitioning and organization

  • play Subtopic 5.5: Storage security and access policies

  • play Subtopic 6.1: Stream processing with Lambda and Kinesis

  • play Subtopic 6.2: Aggregation and windowing techniques

  • play Subtopic 6.3: Handling late-arriving and out-of-order data

  • play Subtopic 6.4: Delivering processed results to sinks

  • play Subtopic 6.5: Combining stream and batch processing

  • play Subtopic 7.1: AWS Step Functions, Azure Durable Functions

  • play Subtopic 7.2: Defining state machines and workflows

  • play Subtopic 7.3: Error handling and retries in orchestration

  • play Subtopic 7.4: Chaining multi-step pipelines

  • play Subtopic 7.5: Visualizing and monitoring execution

  • play Subtopic 8.1: Querying with Athena, BigQuery, Synapse Serverless

  • play Subtopic 8.2: ETL and ELT design in serverless context

  • play Subtopic 8.3: Leveraging Glue and Dataflow for transformation

  • play Subtopic 8.4: Schema inference and metadata cataloging

  • play Subtopic 8.5: Using PySpark and SQL for data prep

  • play Subtopic 9.1: Logging with CloudWatch, Stackdriver, Azure Monitor

  • play Subtopic 9.2: Tracing and profiling functions

  • play Subtopic 9.3: Creating metrics and dashboards

  • play Subtopic 9.4: Handling cold starts and latency issues

  • play Subtopic 9.5: Alerting and anomaly detection

  • play Subtopic 10.1: Understanding billing for serverless workloads

  • play Subtopic 10.2: Reducing invocations and execution time

  • play Subtopic 10.3: Managing data transfer costs

  • play Subtopic 10.4: Setting budgets and usage alerts

  • play Subtopic 10.5: Comparing serverless vs. managed alternatives

  • play Subtopic 11.1: IAM policies for least privilege

  • play Subtopic 11.2: Securing secrets and API keys

  • play Subtopic 11.3: Encrypting data at rest and in transit

  • play Subtopic 11.4: Managing authentication and authorization

  • play Subtopic 11.5: Reviewing serverless security best practices

  • play Subtopic 12.1: Using Terraform and CloudFormation

  • play Subtopic 12.2: Creating reproducible deployments

  • play Subtopic 12.3: Versioning infrastructure and code

  • play Subtopic 12.4: Managing environments and secrets

  • play Subtopic 12.5: Automated testing and validation

  • play Subtopic 13.1: Creating APIs using API Gateway and Lambda

  • play Subtopic 13.2: Designing REST and GraphQL endpoints

  • play Subtopic 13.3: Rate limiting and throttling

  • play Subtopic 13.4: Integrating with data stores

  • play Subtopic 13.5: Securing APIs with OAuth and tokens

  • play Subtopic 14.1: Deploying lightweight models with serverless

  • play Subtopic 14.2: Triggering predictions on data events

  • play Subtopic 14.3: Integrating with SageMaker, Vertex AI, ML.NET

  • play Subtopic 14.4: Streaming inference vs. batch inference

  • play Subtopic 14.5: Scaling ML workloads with autoscaling functions

  • play Subtopic 15.1: Building a full serverless data pipeline

  • play Subtopic 15.2: Real-world case studies from e-commerce and finance

  • play Subtopic 15.3: End-to-end implementation and demo

  • play Subtopic 15.4: Troubleshooting and final optimization

  • play Subtopic 15.5: Presentation and feedback session

img

$ 3,000

Availability Calendar

Find a schedule that works for you. Click any available session to submit a booking.

Selected Session:
Delivery modes & Locations
This Programme Includes

Certificate of completion

Training manual

Reference materials

10 o'clock tea

Lunch

4 o'clock tea

Course Highlights
  • icon 10 Days Intensive Training

  • icon 15 Core Learning Topics

  • icon 10 Days Professional Sessions

  • icon Training Expert-led Delivery

PB Training Institute of Research and Consultancy
FAQs

Frequently Asked Questions

Explore detailed answers to the most common questions about our platform and services.

No questions available at the moment.