Resources on Semantic Table Interpretation

Welcome to the website for all things related to Semantic Table Interpretation (STI). Here, you'll find a wealth of information and resources to help you understand, implement, and innovate in the field of STI. Whether you're a researcher, developer, or enthusiast, our site offers detailed insights, cutting-edge research, and practical tools to support your work in making sense of tabular data

What's Semantic Table Interpretation?

Semantic Table Interpretation, as defined by the the SemTab challenge, involves annotating relational tables with information from a Knowledge Graph (KG). This process includes associating each column in a table with one or more KG types, known as Column Type Annotation (CTA). Additionally, Cell Entity Annotation (CEA) is applied to annotate each cell in named entity columns with a KG entity or mark it as Not In Lexicon (NIL) if it does not exist in the KG. Columns Property Annotation (CPA) involves annotating pairs of columns with a KG property. The result of this annotation process is a table enriched with semantic information

Learn more about STI

TUTSTI @ISWC2024

Discover the comprehensive world of Semantic Table Interpretation (STI) in this tutorial, which covers both theoretical and practical aspects, and trace the evolution of STI from heuristic-based methods to machine learning (ML) techniques and the latest large language model (LLM) innovations. By examining the unique characteristics, advantages, and limitations of each approach you will understand their optimal contexts of use

Our Approaches, Datasets, Tools and UIs

  1. 06/2019

    ESWC, demo

    Interface for STI approaches

  2. MantisTable

    Approach

    10/2019

    OM@ISWC, poster

    Algorithm (CTA, CPA, CEA)

  3. MantisTable

    Approach

    10/2019

    1° SemTab challenge

    Algorithm (CTA, CPA, CEA)

  4. 06/2020

    ESWC, Demo

    Gold Standard analysis

  1. MantisTable

    Approach

    11/2020

    FGCS Journal, paper

    Complete STI algorithm (Processing, CTA, CPA, CEA)

  2. 11/2020

    2° SemTab challenge

    New lookup method

  3. 11/2020

    ISWC, paper

    Gold Standard analysis

  4. 11/2020

    ISWC, paper

    Gold Standard

  1. NEST

    Approach

    09/2021

    SEMANTiCS, paper

    New STI approach with neural soft type constraints

  2. 10/2021

    3° SemTab challenge

    Complete refactoring

  3. s-elBat

    Approach

    10/2022

    4° SemTab challenge

    STI features vector algorithm based on MantisTable V

  4. MammoTab

    Dataset

    10/2022

    4° SemTab challenge

    Gold Standard

  1. 10/2022

    OM@ISWC, paper

    IR-based ER service

  2. 03/2022

    arXiv, paper

    Interface for Data Enrichment

  3. Alligator

    Approach

    10/2023

    WI-IAT, paper

    HITL STI approach based on s-elBat

Our Projects

table illustration

MammoTab

MammoTab is a unique dataset consisting of 1 million Wikipedia tables, extracted from over 20 million Wikipedia pages, and annotated using Wikidata. This dataset fills a gap in the current state-of-the-art resources, making it an excellent tool for testing and training Semantic Table Interpretation approaches. MammoTab is specifically designed to address several key challenges, including disambiguation, homonymy, and NIL-mentions, providing a comprehensive resource for advancing STI research and applications

Check out MammoTab
selbat illustration

s-elBat

s-elBat is a Semantic Table Interpretation approach designed to perform Entity Linking (EL) on tables. It uses an iterative process to link entities within tables and includes an innovative and optimised lookup approach for generating candidate entities for annotation. This method enhances the accuracy and efficiency of STI.

mantis illustration

MantisTable UI

MantisTable UI is the ultimate web interface for managing Semantic Table Interpretation (STI) approaches, providing an intuitive, ready-to-use platform tailored anyone who is interested in STI world.

steellm illustration

stEELlm

stEELlm is an innovative Semantic Table Interpretation tool that transforms data analysis. Employing the precisely tailored Mixtral 8x7B model, stEELlm excels at producing exact semantic annotations across varied datasets.

mantis illustration

LamAPI

LamAPI supports string-based retrieval but also hard and soft filters based on an input entity type (i.e., rdf:type for DBpedia and Property:P31 for Wikidata). Hard type filters remove non matching results, while soft type filters promote or demote results when an exact match is not feasible.