Streaming Data Pipelines Virtual Internship
In this advanced virtual internship, students will learn to build real-time data processing pipelines using stream processing technologies like Apache Kafka, Apache Spark Streaming, and Amazon Kinesis. They will gain hands-on experience in designing, implementing, and deploying scalable and fault-tolerant data pipelines that can handle high-velocity, high-volume data streams. The program will cover topics such as stream processing concepts, event-driven architecture, data ingestion, transformation, and analytics, equipping students with the skills to thrive in the fast-paced world of big data and real-time analytics.
Track Overview
Tasks & Milestones
Comparing Batch and Stream Processing
AdvancedAnalyze the differences between batch and stream processing, and identify use cases where each approach is more suitable.
Exploring Stream Processing Architectures
AdvancedInvestigate the key components and architectural patterns of stream processing systems.
Implementing a Kafka Producer and Consumer
AdvancedBuild a simple application that produces and consumes data using the Apache Kafka API.
Deploying a Kafka Cluster
AdvancedConfigure and deploy a Kafka cluster for a production environment.
Building a Spark Streaming Application
AdvancedDevelop a Spark Streaming application that processes real-time data from a Kafka topic.
Optimizing Spark Streaming Performance
AdvancedAnalyze and optimize the performance of a Spark Streaming application.
Implementing a Kinesis Data Pipeline
AdvancedBuild a real-time data processing pipeline using Amazon Kinesis Data Streams and Kinesis Data Firehose.
Integrating Kinesis with Other AWS Services
AdvancedExtend the Kinesis-based data pipeline by integrating it with other AWS services for end-to-end data processing.
Prerequisites
- • Proficiency in a programming language (Python, Java, or Scala)
- • Experience with databases and data modeling
- • Understanding of distributed systems and cloud computing concepts
Certificate
Certificate of Completion
Earn a certificate upon successful completion