Spark Microservice

The Spark Microservice consumes messages from the Typeahead Service via Kafka, processes the incoming text data in real time, and aggregates word occurrences. These aggregates are stored in HBase for fast, scalable access. This setup enables efficient handling of typeahead suggestions and analytics at scale.

Architecture Overview

Typeahead Service: Produces messages (words) to a Kafka topic. Kafka Broker: Acts as the messaging layer, delivering messages from the Typeahead Service to the Spark Microservice. Spark Microservice: Consumes messages from Kafka, processes them to aggregate word occurrences, and stores the aggregated data in HBase. HBase: A distributed, scalable NoSQL database used for storing and retrieving the aggregated word occurrences efficiently.

Features

Real-time Processing: Consumes data in real time from the Kafka topic, enabling up-to-date analysis of user input.
Scalable Aggregation: Aggregates word occurrences using Apache Spark, with results stored in HBase for efficient querying.
HBase Integration: The word frequency data is stored in HBase, providing a fast and scalable solution for reading and updating word counts.
Spark for Data Processing: Uses Apache Spark for processing large-scale data, performing operations such as filtering, grouping, and aggregation.

License

This project uses the following open-source libraries:

Spring Kafka: Licensed under the Apache License 2.0.
Apache Spark: Licensed under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.vscode		.vscode
spark_microservice		spark_microservice
Dockerfile		Dockerfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark Microservice

Architecture Overview

Features

License

About

Releases

Packages

Languages

josephC45/spark_microservice_for_typeahead_application

Folders and files

Latest commit

History

Repository files navigation

Spark Microservice

Architecture Overview

Features

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages