An Indian fintech company sought to integrate Spark structured streaming and make modifications into their pre-existing spark pipeline for real-time data access and batch process. Additionally, they aimed to implement a high speed deduplication API with fuzzy, phonetic and exact match in AWS Lambda, with data stored in DynamoDB.
A suite of AWS services including Lambda, Glue, Kinesis, DynamoDB, Redshift, StepFunctions, APIGateway, S3, and Elasticsearch were utilised.
Spark Scala was incorporated for real-time data streaming and batch processing.
Its capability to perform fuzzy, phonetic, and exact matches ensured a highly accurate and efficient data filtering process.
Discover and power up enterprises with their ‘unknown gold data mines’ and invent better and smart ways to handle, use and house data.
Try our
contactus@mitz.ai | +919656730556
Data & AI Strategic Assessment
A strategic consulting engagement to help you with a data and AI strategy, to drive and grow your business 3x times quicker.
contactus@mitz.ai | +919656730556