An IIHT Company

Amundsen is a comprehensive data discovery and metadata engine designed to enhance the efficiency of data analysts, engineers, and data scientists during their interactions with data. It is constructed upon a framework comprising five distinct micro-services: the front-end service, metadata service, Neo4j, Databuilder, and Search service. Each of these components must be deployed and maintained independently.

Key Features:
Data Discovery Enhancement: Data scientists often invest a significant amount of time in the initial Data Discovery phase, attempting to grasp the landscape of available data, its locations, users, owners, and access protocols. In this context, Amundsen streamlines the data team’s workflow by reducing the time spent on discovery, allowing for more effective data searches, and ultimately saving time for deeper data exploration.

Metadata Service: The Metadata Service is responsible for managing metadata requests originating from the front-end service and other microservices. It relies on Neo4j as its default persistent layer, although it can be easily replaced with other solutions if necessary.

Search Service: Powered by Elasticsearch, the Search Service handles search queries from the front-end service. While Elasticsearch is the default search engine, it can also be substituted with alternative search technologies.

Front-end Service: The Front-end Service serves as the hosting platform for the Amundsen web application, providing users with an intuitive interface for interacting with the data and metadata.

Databuilder: Databuilder serves as a versatile data ingestion framework capable of extracting metadata from a variety of sources, ensuring comprehensive data coverage.

Common: The Common module is a repository that houses shared code components used across all Amundsen microservices, promoting consistency and efficiency in the development and maintenance processes.