Datasets

GS Tables
cols (min|max|x)
rows (min|max|x)
Classes Entities Pred. KG Used for validation by
T2Dv2 [112] 234
1,2K (1|30|4,52)
2,8K (1|5K|84,55)
39 - 154 DBpedia [21, 22, 30, 37, 40, 43, 44 , 48, 56, 69, 78, 84, 105, 113, 144, 148]
WebTableStitching [112] 50
300 (6|6|6)
717 (3|83|14,84)
9 400 6 DBpedia [112]
Limaye [82] 6,5K - - 747 143K 90 Wikipedia, Yago [21, 22, 30, 40, 44 , 48, 69, 84, 149, 152]
LimayeAll [149] 6,3K 28,5K 136K - 227K - Freebase -
Limaye200 [149] 200 903 4,1K 615 - 361 Freebase [149]
MusicBrainz [149] 1,4K 9,8K - - 93,3K 7K Freebase [149]
IMDB [149] 7,4K 7,4K - - 92,3K - Freebase [149]
Taheriyan [129] 29
2,5K (3|71,3K|529K)
16K (1|13,8K|937)
- - - Schema.org [129]
Tough Table (2T) [32] 180
194K (1|8|4,46)
802 (6|15,5K|108K)
540 667K 0 Wikidata, DBpedia -
MammoTab [87] 980K 5,6M 2,3M 2M 2,8M - Wikidata -
SOTAB [77] 108K - - 91 - 176 Schema.org -
Wikary [89] 81,7K 22,5K 63,9K - 30,6K 188 Wikidata -
GitTables [55] 962K 11,5M 13,6M 2,4K - - Schema.org, DBpedia -
RedTab [119] 9K
44,6K (1|11|4,86)
148K (1|353|17,09)
70 - 23 Music, Literature -
TURL [37] 484K 2,8M 7,9M - 1,2M - DBpedia [37]
BiodivTab [5] 50
1,2K (1|43|23,96)
12,9K (26|4,9K|261)
84 1,2K - Wikidata [5]
TSOTSACorpus [64] 16K - - 200 60K - Food Data -
SemTab2019 R1 64
320 (3|14|5,05)
9K (7|586|143)
120 8,4K 116 DBpedia [20, 28, 91, 97, 122, 133 ]
R2 11,9K
59,6K (1|51|5,55)
29,8K (1|1,5K|27,06)
14,8K 464K 6,7K
R3 2,1K
10,8K (4|8|4,51)
153K (6|207|71,69)
5,7K 407K 7,6K
R4 817
3,3K (4|8|4,36)
51,4K (6|198|63,73)
1,7K 107K 2,7K
SemTab2020 R1 34,3K
170K (4|8|4,96)
249K (5|16|8,7)
136K 985K 136K Wikidata [1, 11, 13, 23, 27, 30, 59, 69, 71, 99, 118, 134, 143]
R2 12,1K
55,9K (4|8|4,6)
84,9K (5|16|7,97)
438K 283K 43,8K
R3 62,6K
229K (3|7|3,66)
397K (3|16|7,34)
167K 768K 167K
R4 22,4K
79,6K (1|8|3,55)
670K (6|15,5K|30,94)
32,5K 1,7M 56,5K
SemTab2021 R1 180
802 (1|8|4,46)
194K (6|15,5K|1,08K)
539 667K 56,5K
Wikidata DBpedia
[2, 3, 9, 12, 58, 100, 121, 142 ]
R2 1,7K
5,6K (2|7|3,19)
29,3K (5|58|17,73)
2,1K 47,4K 3,8K Wikidata
R3 7,2KK
17,9K (2|5|2,48)
58,9K (5|21|9,18)
7,2K 58,9K 10,7K
SemTab2022 R1 3,8K
9,9K (2|5|2,56)
22,4K (4|8|5,69)
240 1,4K 319 Wikidata [4, 24, 13, 29, 57, 84]
R2 HT 5,1K
13,3K (2|5|2,56)
28,5K (4|8|5,57)
398 1,9K 348
R2 2T 180 802 195K
97 111
81K 177K
-
Wikipedia DBpedia
R3 Biodiv 50 1,2K 12,9K 43 1,5K -
R3 GitTables 7,6K 198K 841K
6,2K 4,4K 1K
- -
Wikidata Schema.org Schema.org
SemTab2023 R1 10,4K
26,1K (2|4|2,51)
49,1K (3|11|5,72)
- - -
Wikidata tfood Schema.org
-
R2 - - - - - -
Schema.org dbpedia
-

Datasets infographic