LakeFusion Environment

LakeFusion Environment

The LakeFusion Environment is the core of the platform and consists of two main layers:

  • Micro Frontends: The user interface components.

  • Microservices: The backend logic and processing components.

a. Micro Frontends

The LakeFusion Micro Frontends are built using front-end technologies like single-spa and React. This approach allows for modular development and deployment of individual portals. These portals include:

  1. Root Portal:
    Central access point for system-wide configurations, such as administrative tasks and global settings.

  2. Main Portal:
    Hosts core MDM functionalities like managing master data, deduplication, and data quality checks. It also includes tools for managing workflows and facilitating collaboration.

  3. Utility Portal:
    Provides supporting tools like user management, access logs, and troubleshooting documentation.

Each portal operates independently, ensuring better performance and flexibility.

b. Microservices

The backend logic resides in the Microservices layer, built with FastAPI and Python. Key services include:

  1. Authentication Service:
    Manages user authentication and authorization, integrated with Databricks SSO for secure access.

  2. Databricks Service:
    Acts as the communication bridge between LakeFusion and Databricks. This service handles:

    • Query execution on SQL Warehouses.

    • Orchestration of Databricks Workflows.

    • Access to Databricks resources such as Unity Catalog for data governance.

  3. Middle Layer Service:
    Connects the front end with backend services, facilitating API calls between the micro frontends and microservices.

  4. AI/ML Service:
    Powers advanced analytics with capabilities like:

    • Predictive analytics for identifying patterns in master data.

    • Deduplication algorithms to ensure data quality.

    • Machine learning models deployed for anomaly detection and forecasting.

c. Transactional Database

The backend relies on the LakeFusion Transactional Database, which supports MySQL, MS SQL, and PostgreSQL. It stores and manages:

  • Transactional data generated during user interactions with the platform.

  • Metadata related to workflows, datasets, and configurations.

  • Temporary and intermediate data used during analytics and processing.

    • Related Articles

    • Customer Environment

      The Customer Environment represents the user's access point to the LakeFusion platform. The key steps include: User Access: Customers access LakeFusion via their web browser. Authentication: Authentication is handled using Databricks OpenID Connect ...
    • LakeFusion Deployment Guide via Azure Marketplace

      Prerequisite: Register Required Azure Resource Providers Before deploying LakeFusion from the Azure Marketplace, ensure that the necessary resource providers are registered for your Azure subscription. Follow the steps below: Navigate to ...
    • Data Flow in LakeFusion

      This section provides a structured overview of the LakeFusion Data Flow, outlining the key stages and enabling technologies that support seamless data ingestion, preprocessing, and Master Data Management (MDM). Each stage ensures data is unified, ...
    • Who is LakeFusion MDM for?

      LakeFusion is ideal for data-driven enterprises seeking to solve challenges related to fragmented data, poor data quality, and unreliable analytics. It serves a wide range of users across business and technical teams. Business Users Chief Data ...
    • Integration Hub

      Integration Task creation Navigate to Integration Hub post-Match Maven completion Configure new pipeline with required parameters: Task Name designation Entity selection Model specification Execute task creation Access workflow configuration via ...