Nairobi, Kenya

0728 269396

Master Distributed Databases For Real-time, Scalable Applications

As data engineering increasingly demands handling diverse, large-scale, and high-velocity data, mastering NoSQL databases becomes crucial. This Advanced NoSQL for Data Engineers (MongoDB, Cassandra, e...

Click to Register

ONSITE OR VIRTUAL

May 04 - May 08
Programme Overview
Training Description

Who Should Attend
This course is designed for;
  1. Data Engineers building scalable data infrastructure
  2. Database Administrators transitioning to NoSQL platforms
  3. Backend Developers implementing NoSQL-driven applications
  4. DevOps Engineers managing distributed systems
  5. Cloud Engineers deploying NoSQL services at scale
  6. Data Architects designing flexible and performant schemas
  7. ETL Developers optimizing data ingestion into NoSQL systems
  8. Full-Stack Developers integrating NoSQL into backend APIs

Session Objectives
  • Understand the core principles of NoSQL database types and architectures
  • Gain deep hands-on experience with MongoDB and Apache Cassandra
  • Design high-performance NoSQL schemas for real-world applications
  • Implement advanced indexing, sharding, and replication strategies
  • Integrate NoSQL databases with modern data pipelines
  • Optimize NoSQL query performance and throughput
  • Ensure data consistency and availability across distributed clusters
  • Monitor and maintain large-scale NoSQL infrastructure
  • Secure NoSQL deployments with proper access control and encryption
  • Compare and evaluate NoSQL options for specific business needs
  • Build resilient, cloud-native NoSQL-powered systems
About the Course

As data engineering increasingly demands handling diverse, large-scale, and high-velocity data, mastering NoSQL databases becomes crucial. This Advanced NoSQL for Data Engineers (MongoDB, Cassandra, etc.) training course equips participants with practical and in-depth expertise to design, optimize, and manage distributed NoSQL systems used in modern big data pipelines. Through a hands-on approach, participants will explore schema design, indexing, querying, replication, sharding, and performance tuning with leading NoSQL platforms such as MongoDB, Apache Cassandra, Redis, and others. Ideal for professionals building real-time analytics platforms, microservices backends, or IoT-scale infrastructure, this course ensures engineers are fully capable of leveraging NoSQL technologies for scalability, flexibility, and high availability.

General Notes
  1. This course will be delivered by our experts and professionals in data analysis with vast experience. The workshop will be highly interactive  
  2. Training manuals and additional reference materials are provided to the participants.
  3. Upon successful completion of this course, participants will be issued with a certificate.
  4. The training will be conducted by PB Institute of Research and Technology
  5. The training fee covers tuition fees, training materials, lunch and training venue. Accommodation and airport transfer are arranged for our participants upon request.
Curriculum & Topics

15 Topics | 10 Days

  • play Subtopic 1.1: Understanding the evolution from relational to NoSQL databases

  • play Subtopic 1.2: Classification: key-value, document, wide-column, and graph databases

  • play Subtopic 1.3: Use cases and advantages of NoSQL over traditional RDBMS

  • play Subtopic 1.4: CAP theorem and its implications on NoSQL systems

  • play Subtopic 1.5: Overview of popular NoSQL platforms and ecosystems

  • play Subtopic 2.1: Document data model and BSON structure

  • play Subtopic 2.2: CRUD operations with MongoDB shell and drivers

  • play Subtopic 2.3: Indexing strategies for performance optimization

  • play Subtopic 2.4: Schema design best practices for flexible structures

  • play Subtopic 2.5: Aggregation framework and pipeline patterns

  • play Subtopic 3.1: Replication and replica set configuration

  • play Subtopic 3.2: Sharding and horizontal scaling techniques

  • play Subtopic 3.3: Transactions and ACID compliance in MongoDB

  • play Subtopic 3.4: Backup, restore, and disaster recovery planning

  • play Subtopic 3.5: Performance tuning and profiling queries

  • play Subtopic 4.1: Introduction to wide-column data models

  • play Subtopic 4.2: Understanding Cassandra architecture and write path

  • play Subtopic 4.3: Key spaces, tables, and CQL (Cassandra Query Language)

  • play Subtopic 4.4: Partitions and clustering for distributed data

  • play Subtopic 4.5: Data modeling patterns for time series and events

  • play Subtopic 5.1: Replication strategies and consistency levels

  • play Subtopic 5.2: Read/write performance optimization

  • play Subtopic 5.3: Compaction, caching, and garbage collection settings

  • play Subtopic 5.4: Using nodetool and cqlsh for cluster management

  • play Subtopic 5.5: Monitoring metrics and alerts in production

  • play Subtopic 6.1: Key-value store concepts with Redis

  • play Subtopic 6.2: Use cases: caching, pub/sub, real-time counters

  • play Subtopic 6.3: Data structures: sets, lists, sorted sets, hashes

  • play Subtopic 6.4: Persistence options and memory optimization

  • play Subtopic 6.5: Redis Cluster and Sentinel configuration

  • play Subtopic 7.1: Designing for reads vs writes in NoSQL systems

  • play Subtopic 7.2: Denormalization and embedded document strategies

  • play Subtopic 7.3: Modeling one-to-many and many-to-many relationships

  • play Subtopic 7.4: Choosing partition keys and avoiding hotspots

  • play Subtopic 7.5: Trade-offs between flexibility and consistency

  • play Subtopic 8.1: Indexing techniques and query planners

  • play Subtopic 8.2: Aggregation tuning and pipeline optimization

  • play Subtopic 8.3: Query profiling tools in MongoDB and Cassandra

  • play Subtopic 8.4: Latency reduction and throughput scaling

  • play Subtopic 8.5: Identifying and resolving anti-patterns

  • play Subtopic 9.1: Authentication and role-based access in MongoDB

  • play Subtopic 9.2: Secure client connections using TLS/SSL

  • play Subtopic 9.3: Auditing and activity logging

  • play Subtopic 9.4: Data encryption at rest and in transit

  • play Subtopic 9.5: Security hardening of NoSQL clusters

  • play Subtopic 10.1: Snapshot-based backup strategies

  • play Subtopic 10.2: Point-in-time recovery techniques

  • play Subtopic 10.3: Cluster failover and leader election

  • play Subtopic 10.4: Data migration across environments

  • play Subtopic 10.5: Ensuring uptime with distributed replication

  • play Subtopic 11.1: Connecting NoSQL systems with Apache Kafka and Spark

  • play Subtopic 11.2: Streaming data ingestion from microservices

  • play Subtopic 11.3: ETL workflows with NoSQL as sink or source

  • play Subtopic 11.4: Data enrichment and transformation patterns

  • play Subtopic 11.5: Real-time analytics architecture

  • play Subtopic 12.1: Prometheus and Grafana for MongoDB/Cassandra metrics

  • play Subtopic 12.2: Query performance dashboards and latency tracking

  • play Subtopic 12.3: Disk I/O, memory, and CPU usage monitoring

  • play Subtopic 12.4: Alerting strategies for cluster health

  • play Subtopic 12.5: Log aggregation and analysis

  • play Subtopic 13.1: Managed services (MongoDB Atlas, Amazon Keyspaces)

  • play Subtopic 13.2: Infrastructure as Code for provisioning clusters

  • play Subtopic 13.3: Auto-scaling and load balancing

  • play Subtopic 13.4: Cost optimization strategies in cloud environments

  • play Subtopic 13.5: Multi-region replication and latency considerations

  • play Subtopic 14.1: Benchmarking MongoDB vs Cassandra vs Redis

  • play Subtopic 14.2: Selecting the right NoSQL database by workload

  • play Subtopic 14.3: Hybrid architectures with SQL + NoSQL

  • play Subtopic 14.4: Polyglot persistence strategies

  • play Subtopic 14.5: Business case evaluation

  • play Subtopic 15.1: Designing a high-throughput document and key-value data store

  • play Subtopic 15.2: Implementing schema, indexing, and access controls

  • play Subtopic 15.3: Integrating the system with real-time ingestion pipelines

  • play Subtopic 15.4: Monitoring, securing, and deploying the solution

  • play Subtopic 15.5: Presenting project outcomes and scalability plans

img

$ 3,000

Availability Calendar

Find a schedule that works for you. Click any available session to submit a booking.

Selected Session:
Delivery modes & Locations
This Programme Includes

Certificate of completion

Training manual

Reference materials

10 o'clock tea

Lunch

4 o'clock tea

Course Highlights
  • icon 10 Days Intensive Training

  • icon 15 Core Learning Topics

  • icon 10 Days Professional Sessions

  • icon Training Expert-led Delivery

PB Training Institute of Research and Consultancy
FAQs

Frequently Asked Questions

Explore detailed answers to the most common questions about our platform and services.

No questions available at the moment.