Entity Creation

Entity Creation

Entity configuration establishes the foundation for golden record generation by consolidating and organizing multiple data sources within a unified entity structure.

Step 1: Entity Creation

          1.Access the Entity Creation card (either from Home or from the left navigation pane)

                   

             

2. Initiate a new entity configuration with the “Create Entity” button

Input the following required parameters:

  • Entity Name 

  • Comprehensive entity description detailing its purpose and scope

Execute entity creation with the “Create” button

    2. Click on entity name 

   

Step 2: Attributes Creation

  1. Access attribute creation via the Add Attributes function


  1. Specify the following attribute parameters:

  • Attribute Name 

  • Detailed attribute description

  • Appropriate label designation

  • Corresponding data type

  • Primary key designation (if applicable)

  1. Complete attribute creation

  2. Iterate attribute creation process as needed

  3. Designate attributes for UI and Primary Key


Step 3: Map Datasets


  1. Establish correlation between created attributes and existing dataset fields

  2. Execute mapping procedure for all relevant datasets

  1. Designate primary dataset for reference


Step 4: Survivorship Rules Configuration

Survivorship rules establish field precedence during match-merge operations, determining data priority from primary and secondary sources.

  1. Go to survivorship tab

  1. Click on Create Survivorship Group

  1. Initialize new survivorship rule group by clicking on “Add Attribute”

  1. Configure required parameters:

  • Group Name 

  • Detailed description

  • Rule specifications

  1. Implement strategy rules for each attribute:

  • Select appropriate strategy (source system, aggregation, etc.)


Step 5: Validation Rules Implementation

  1. Configure validation parameters for attribute consistency

  2. Select Validate Functions Tab

  1. Click on Assign Validation Function

  1. Specify required validation elements:

  • Target attribute selection

  • Function type selection (pre-defined/user-defined)

  • Validation level(Warning/Error)

  1. Execute validation function creation

  1. Validation functions can be user defined functions too

  1. Click on the + sign to create user defined function


  1. Add the following:

  • Function Name

  • Function Type

  • Function Definition


    • Related Articles

    • Dataset Creation

      In the this configuration, the dataset’s location within Databricks is specified, and it is set up for subsequent tasks in LakeFusion. Locate and select the Datasets card on the home screen Initialize the dataset creation process with the “Create ...
    • Entity Search

      After running Match Maven, LakeFusion sends critical or uncertain matches to Entity Search for manual review and decisions. Step 1: Review Critical Entities Monitor email notifications for critical entity review requests Access Entity Search ...
    • Integration Hub

      Integration Task creation Navigate to Integration Hub post-Match Maven completion Configure new pipeline with required parameters: Task Name designation Entity selection Model specification Execute task creation Access workflow configuration via ...
    • Match Maven

      The Match Maven module enables data teams to build and evaluate match-merge models using large language models (LLMs) and embedding-based similarity techniques. It is designed for experimentation, iteration, and optimization of custom entity ...
    • Customer 360

      What is Customer 360? Customer 360 is a comprehensive data management capability within LakeFusion that delivers a unified, real-time view of all customer-related information across an organization. Powered by Databricks Lakehouse technology, ...