AWS Cloud Architecture

Below diagram shows how we have implemented our data engineering pipeline, model training pipeline, and inference pipeline in the AWS cloud.

Architecture Overview

Data Pipeline

Data ingestion engine asynchronously receives data from hundreds of data sources then processes data in parallel through data pipeline implemented using Apache Beam.

Learn more

Model Training Pipeline

Model training pipeline comprises of AWS GPU enabled p/g EC2 instances equipped with deep learning AMI and Nvidia GPUs. We use AWS Sagemaker with Pipe mode for robust model training.

Learn more

Inference Pipeline

We use Tensorflow Serving (TFX) manually compiled with Bazel for GPU inference thorough AWS Lambda function to invoke the endpoint via gRPC or REST protocol.

Learn more