Main Page » Advanced User Guide » Statistics
Statistics
This article lists statistics on various databases instances that have been created with BaseX, with value and full-text indexes turned off. The URLs to the original sources, if available or public, are listed below.
Databases in BaseX are light-weight. If a database limit is reached, you can distribute your documents across multiple database instances and access all of them with a single XQuery expression.Databases
Instances | FileSize | Files | DbSize | Nodes | ENames | ANames | URIs | Height |
---|---|---|---|---|---|---|---|---|
Limits | 512 GiB (2^39) | 536'870'912 (2^29) | no limit | 2'147'483'648 (2^31) | 32768 (2^15) | 32768 (2^15) | 256 (2^8) | no limit |
RuWikiHist | 421 GiB | 1 | 416 GiB | 324'848'508 | 21 | 6 | 2 | 6 |
ZhWikiHist | 126 GiB | 1 | 120 GiB | 179'199'662 | 21 | 6 | 2 | 6 |
EnWiktionary | 79 GiB | 1 | 75 GiB | 134'380'393 | 21 | 6 | 2 | 6 |
XMark | 55 GiB | 1 | 64 GiB | 1'615'071'348 | 74 | 9 | 0 | 13 |
EnWikiMeta | 54 GiB | 1 | 52 GiB | 401'456'348 | 21 | 6 | 2 | 6 |
MedLine | 38 GiB | 379 | 36 GiB | 1'623'764'254 | 84 | 6 | 0 | 9 |
iProClass | 36 GiB | 1 | 37 GiB | 1'631'218'984 | 245 | 4 | 2 | 9 |
Inex2009 | 31 GiB | 2'666'500 | 34 GiB | 1'336'110'639 | 28'034 | 451 | 1 | 37 |
CoPhIR | 29 GiB | 10'000'000 | 31 GiB | 1'104'623'376 | 42 | 42 | 0 | 8 |
EnWikipedia | 26 GiB | 1 | 25 GiB | 198'546'747 | 24 | 21 | 2 | 6 |
XMark | 22 GiB | 1 | 26 GiB | 645'997'965 | 74 | 9 | 0 | 13 |
InterPro | 14 GiB | 1 | 19 GiB | 860'304'235 | 7 | 15 | 0 | 4 |
Genome1 | 13 GiB | 1 | 13 GiB | 432'628'105 | 26 | 101 | 2 | 6 |
NewYorkTimes | 12 GiB | 1'855'659 | 13 GiB | 280'407'005 | 41 | 33 | 0 | 6 |
TrEMBL | 11 GiB | 1 | 14 GiB | 589'650'535 | 47 | 30 | 2 | 7 |
XMark | 11 GiB | 1 | 13 GiB | 323'083'409 | 74 | 9 | 0 | 13 |
IntAct | 7973 MiB | 25'624 | 6717 MiB | 297'478'392 | 64 | 22 | 2 | 14 |
Freebase | 7366 MiB | 1 | 10 GiB | 443'627'994 | 61 | 283 | 1 | 93 |
SDMX | 6356 MiB | 1 | 8028 MiB | 395'871'872 | 22 | 6 | 3 | 7 |
OpenStreetMap | 5312 MiB | 1 | 5171 MiB | 6'910'669 | 19 | 5 | 2 | 6 |
SwissProt | 4604 MiB | 1 | 5422 MiB | 241'274'406 | 70 | 39 | 2 | 7 |
EURLex | 4815 MiB | 1 | 5532 MiB | 167'328'039 | 186 | 46 | 1 | 12 |
Wikicorpus | 4492 MiB | 659'338 | 4432 MiB | 157'948'561 | 1'257 | 2'687 | 2 | 50 |
EnWikiRDF | 3679 MiB | 1 | 3537 MiB | 98'433'194 | 11 | 2 | 11 | 4 |
CoPhIR | 2695 MiB | 1'000'000 | 2882 MiB | 101'638'857 | 42 | 42 | 0 | 8 |
MeSH | 2091 MiB | 1 | 2410 MiB | 104'845'819 | 6 | 5 | 2 | 5 |
FreeDB | 1723 MiB | 1 | 2462 MiB | 102'901'519 | 7 | 3 | 0 | 4 |
XMark | 1134 MiB | 1 | 1303 MiB | 32'298'989 | 74 | 9 | 0 | 13 |
DeepFS | 810 MiB | 1 | 850 MiB | 44'821'506 | 3 | 6 | 0 | 24 |
LibraryUKN | 760 MiB | 1 | 918 MiB | 46'401'941 | 23 | 3 | 0 | 5 |
736 MiB | 1'177'495 | 767 MiB | 15'309'015 | 8 | 0 | 0 | 3 | |
Organizations | 733 MiB | 1'019'132 | 724 MiB | 33'112'392 | 38 | 9 | 0 | 7 |
DBLP | 694 MiB | 1 | 944 MiB | 36'878'181 | 35 | 6 | 0 | 7 |
Feeds | 692 MiB | 444'014 | 604 MiB | 5'933'713 | 8 | 0 | 0 | 3 |
MedLineSupp | 477 MiB | 1 | 407 MiB | 21'602'141 | 55 | 7 | 0 | 9 |
AirBase | 449 MiB | 38 | 273 MiB | 14'512'851 | 111 | 5 | 0 | 11 |
MedLineDesc | 260 MiB | 1 | 195 MiB | 10'401'847 | 66 | 8 | 0 | 9 |
ZDNET | 130 MiB | 95'663 | 133 MiB | 3'060'186 | 40 | 90 | 0 | 13 |
JMNEdict | 124 MiB | 1 | 171 MiB | 8'592'666 | 10 | 0 | 0 | 5 |
XMark | 111 MiB | 1 | 130 MiB | 3'221'926 | 74 | 9 | 0 | 13 |
Freshmeat | 105 MiB | 1 | 86 MiB | 3'832'028 | 58 | 1 | 0 | 6 |
DeepFS | 83 MiB | 1 | 93 MiB | 4'842'638 | 3 | 6 | 0 | 21 |
Treebank | 82 MiB | 1 | 92 MiB | 3'829'513 | 250 | 1 | 0 | 37 |
DBLP2 | 80 MiB | 170'843 | 102 MiB | 4'044'649 | 35 | 6 | 0 | 6 |
DDI | 76 MiB | 3 | 39 MiB | 2'070'157 | 104 | 16 | 21 | 11 |
Alfred | 75 MiB | 1 | 68 MiB | 3'784'285 | 60 | 0 | 0 | 6 |
University | 56 MiB | 6 | 66 MiB | 3'468'606 | 28 | 4 | 0 | 5 |
MediaUKN | 38 MiB | 1 | 45 MiB | 1'619'443 | 21 | 3 | 0 | 5 |
HCIBIB2 | 32 MiB | 26'390 | 33 MiB | 617'023 | 39 | 1 | 0 | 4 |
Nasa | 24 MiB | 1 | 25 MiB | 845'805 | 61 | 8 | 1 | 9 |
MovieDB | 16 MiB | 1 | 19 MiB | 868'980 | 7 | 8 | 0 | 4 |
KanjiDic2 | 13 MiB | 1 | 18 MiB | 917'833 | 27 | 10 | 0 | 6 |
XMark | 11 MiB | 1 | 13 MiB | 324'274 | 74 | 9 | 0 | 13 |
Shakespeare | 7711 KiB | 1 | 9854 KiB | 327'170 | 59 | 0 | 0 | 9 |
TreeOfLife | 5425 KiB | 1 | 7106 KiB | 363'560 | 4 | 7 | 0 | 243 |
Thesaurus | 4288 KiB | 1 | 4088 KiB | 201'798 | 33 | 9 | 0 | 7 |
MusicXML | 3155 KiB | 17 | 2942 KiB | 171'400 | 179 | 56 | 0 | 8 |
BibDBPub | 2292 KiB | 3'465 | 2359 KiB | 80'178 | 54 | 1 | 0 | 4 |
Factbook | 1743 KiB | 1 | 1560 KiB | 77'315 | 23 | 32 | 0 | 6 |
XMark | 1134 KiB | 1 | 1334 KiB | 33'056 | 74 | 9 | 0 | 13 |
Notes
- FileSize is the original size of the input documents.
- Files indicates the number of stored XML documents.
- DbSize is the size of the resulting database (excluding the value index structures).
- Nodes represents the number of XML nodes (elements, attributes, texts, etc.) stored in the database.
- Attr indicates the maximum number of attributes stored for a single element.
- ENames and No.ANames reflect the number of distinct element and attribute names.
- URIs represent the number of distinct namespace URIs.
- Height indicates the maximum level depth of the stored nodes.
Sources
⚡Generated with XQuery