Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j

From: Jerven Tjalling Bolleman <Jerven.Bolleman@sib.swiss>
Date: Mon, 7 Nov 2022 07:29:21 +0000
Accept-language: en-GB, en-US
Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=sib.swiss; dmarc=pass action=none header.from=sib.swiss; dkim=pass header.d=sib.swiss; arc=none
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=2jSNdIaE+24mKdhuRlUvW3Z0b2LWRvbzfvTSopiqNpM=; b=S7qtUylV1DapTe/rs/DE0Ctxbrhcd7DusVw1TGwhYBzVpwz17QrlzAsqdhP3fDqmqcDZo3hL8tDyXc8pb9HXkCqHpcsv+oyrMNryR97bqbYoNBUfhXTO1RRImys1YakpKV9OXNDUsgndO8eKOcP82sBUcTHErwccHSc313M8nlvH7MzymAtxSSo4iTvjRT2KgiWqWoYgBAK3UMeSH8bZatfiC7/WV5poUXyiMg5olkhYnWigmMTIr9f0GOBnzFP8a3ThWFVmoZRQYBAGz5+Tj8YCLEqH51btv1FONfKRMvEA41Q/+TT10eUAttEHM30Rg7oqtw1nbOCAFzHltvMlUA==
Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=dg0bJWt0zVp/ocZOrp4npOExWccHTG2WkPts/vDar7YCMvu0l5/lz8/zyOzh65tNx2/RlqiTlga8BgMrqnV10Ic4izmVuDAwZPXVw8zsWcKO1jM6JAtHyZFXult7YdASsoUSp2xl3MIpJsBrMDCz3pZh+2piPtDgZxv/D5Vd6Zdj9Wz80hBtJpMtlsZRKP9kGW+LEKBStB37qfRhM/54C4yyJHokjcxuATxm3Qnil/QIjXuseG8QzChYR6hfAITXDNjuUhybE48CPib2SpRlzyla+ITsu8/T0pmWAnAp3/s9O9cF3f3JC7p9F8+zE77BtPNsKsi+l9cC5hPKlC3wHg==
Delivered-to: rdf4j-dev@xxxxxxxxxxx
List-archive: <https://www.eclipse.org/mailman/private/rdf4j-dev/>
List-help: <mailto:rdf4j-dev-request@eclipse.org?subject=help>
List-subscribe: <https://www.eclipse.org/mailman/listinfo/rdf4j-dev>, <mailto:rdf4j-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://www.eclipse.org/mailman/options/rdf4j-dev>, <mailto:rdf4j-dev-request@eclipse.org?subject=unsubscribe>
Msip_labels:
Thread-index: AQHY7TJmbjKmpy5nK0Gl0thIhVWaSa4x5JcAgAAQkwCAASO/VQ==
Thread-topic: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j

Dear Matthew,

I mostly agree and federation is the only true solution ! e.g. at my work we have 10 public endpoints

https://www.expasy.org/search/sparql

If people in the academic side of things want to work with this, please let me know. There is an open grant call

that applies to a number of countries which we want to apply to, so that we can work on the practical sides of the

federated dataset problem.

Of which UniProt is currently the largest, however it should not be the largest. I think of UniProtKB as a small dataset in the real world 🙂

In the end I think fast query performance scales down very well.

Regards,

Jerven

Jerven Tjalling Bolleman
Principal Software Developer
SIB | Swiss Institute of Bioinformatics
1, rue Michel Servet - CH 1211 Geneva 4 - Switzerland
t +41 22 379 58 85
Jerven.Bolleman@sib.swiss - www.sib.swiss

From: rdf4j-dev <rdf4j-dev-bounces@xxxxxxxxxxx> on behalf of Matthew Nguyen via rdf4j-dev <rdf4j-dev@xxxxxxxxxxx>
Sent: 06 November 2022 14:58
To: rdf4j developer discussions <rdf4j-dev@xxxxxxxxxxx>
Cc: Matthew Nguyen <nguyenm9@xxxxxxx>
Subject: Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j

while I absolutely applaud your work here, I generally feel like concentrating on very large triplestores undermines the intentions of semtech which is really about distributed datasets. would love to see more work on optimizing that path. hope to contribute to that conversation much more down the road.

On Sunday, November 6, 2022 at 07:58:49 AM EST, jerven Bolleman <jerven.bolleman@sib.swiss> wrote:

Hi All,

Some more experimental details. So for the UniProtKB 2022_04 dataset
there are 17,435,087,503 quads where the predicate is rdf:type.
On disk this consumes 6,411,506,834 bytes. Leading to just under 3 bits
per quad of this kind disk usage.
So it shows inverting an index and bitset compression can really pay off

'SELECT (COUNT(*?) AS ?c) WHERE {?s a ?o }' took 92 minutes to run :(
Time is mostly spend on CPU intensive tasks.

25% of time goes to maintaining iterator in subject order (which is not
needed here but I coded it that way).
16% Is in testing if the ArrayBindingSet isEmpty or not :(
16% Is actual addSolution in the GroupIterator
14% Is spend in HashMap.get in the hot loop of buildEntries
9% Is iterating over the low level datastructures.

1 query pegs a single core to 100%.

So we can speed this up some more even though this is best case query
right now for the backing store.

Hope it is interesting :)

Regards,
Jerven

PS. The blocker off having a 100 billion+ triple store running on a my 5
year old laptop is:
https://github.com/RoaringBitmap/RoaringBitmap/issues/590

On 31/10/2022 15:09, jerven Bolleman wrote:
> Dear RDF4j dev-community,
>
> I have been distracted by writing a write-once/read-many quad store :)
>
> This store is designed with some of the challenges of UniProt in mind.
> It is based around two concepts sort all the things, and don't mix value
> types. This quad store is aimed to be good for datasets with up to about
> 4000 distinct predicates and graphs in a few 100s range, billions of
> distinct values and trillions of triples. That change relatively rarely
> and when they do can be generated/reloaded from scratch.
>
> # Some technical snippets.
>
> ## Sorted lists for values
>
> The store has dictionaries for values like the vast majority of quad
> stores. Difference is one dictionary for each distinct datatype plus one
> for iris. A nuance of these dictionaries are that they are based around
> sorted lists compressed and memory mapped and all keys are therefore
> just index position values. These keys are valid for comparison
> operators e.g. key 1 value "a" key 2 value "b" and key comparison
> (Long.compare) would match SPARQL value comparison.
>
> ## Partioned triple tables, with graph filters
>
> The quad table however is highly partitioned. e.g. one table per
> * if the subject is bnode or iri
> * the unique predicate
> * if the object is bnode or iri or specific datatype.
>
> e.g.
>
> _:1 :pred_0 <http://example.org/iri> .
> <http://example.org/iri> :pred_0 3 .
> <http://example.org/iri> :pred_0 "lala" .
>
> Will be stored in 3 distinct tables. Allowing us to a completely avoid
> storing the predicates and the type of subject or object. For now stored
> in separate files e.g.
>
> ./pred_0/bnode/iris
> ./pred_0/iri/datatype_xsd_int
> ./pred_0/iri/datatype_xsd_string
>
> Which graphs a triple is in is encoded in bitset (roaring for
> compression) and there might be multiple graph bitsets per table.
> All graphs must be identified by an IRI.
>
> ## Inverted indexes using bitsets
> Many values can be stored complet
> ely inline in such a representation
> and we also do inversion of the table. e.g. very valuable for when there
> is a small set of distinct objects. e.g. for a with boolean values
>
> We do
> true -> [:iri1, :iri2, :iri4]
> false -> [:iri1, :iri4, :iri8]
>
> instead of
> :iri1 true
> :iri1 false
> :iri2 true
> :iri4 true
> :iri4 false
> :iri7 false
>
> As all iri's string values are addressable by a 63 bit long value
> (positive only). We an turn this into two bitsets. Which give very large
> compression ratios and speed afterwards. Reduction to 2% of the input
> data for quite a large number of datasets is possible. (2/3rds of the
> predicate value combinations in UniProtKB are compressible this way)
>
> ## Join optimization candidates
>
> Considering all triples are stored in subject, object order (or that
> order is cheap to generate) we can also do a MergeJoin per default for
> all patterns where a "subject variable" is joined on. BitSet joins might
> in some cases also be possible.
>
> ## Open work
>
> There is still a lot of work to be done to make it as fast as possible
> and validate that it really works as it is supposed too.
> * Strings using less than nine UTF-8 characters are also inline value
> candidates but this is not wired up yet.
> * FSST compression for the IRI dictionary instead of LZ4.
> * Cleanup experiments
> * Document more :(
> * Reduce temporary file size requirements during compression stage (7TB
> for UniProtKB)
>
>
> ## Early results
>
> Early results are encouraging. With for UniProtKB release we need 610 GB
> of diskspace. 197 GB for the "quads" the other 413GB for the values.
> e.g. roughly 16 bit per triple! This is better than the raw rdf/xml
> compressed with xz --best :)
>
> Loading time (for UniProtKB 2022_04) is currently 59 hours on a 128 core
> machine (first generation EPYC). With 24 hours in preparsing the rdf/xml
> and merge sorting the triples. Another 10 hours in sorting all IRIs, and
> 25 for converting all values in the triple tables down into their long
> identifiers.
>
> In principle the first and last step are highly parallelize and the last
> step might be much faster when moving from lz4 to fsst[1] compression
> for IRIs and long strings.
>
> I have an in principle agreement that I am allowed to contribute this to
> RDF4j. But would like to poll if there is a desire for this and what
> kind of paper work do I need to supply.
>
> Considering it is a larger than normal contribution for me. I won't make
> the code available until I am clear that the paperwork will be fine/or
> that making it fine requires it to be open somewhere already.
>
> Regards,
> Jerven
>
>
> [1] https://github.com/cwida/fsst/
>
>
>
>
>
>
>
>
>

--

*Jerven Tjalling Bolleman*
Principal Software Developer
*SIB | Swiss Institute of Bioinformatics*
1, rue Michel Servet - CH 1211 Geneva 4 - Switzerland
t +41 22 379 58 85
Jerven.Bolleman@sib.swiss - www.sib.swiss

_______________________________________________
rdf4j-dev mailing list
rdf4j-dev@xxxxxxxxxxx
To unsubscribe from this list, visit https://www.eclipse.org/mailman/listinfo/rdf4j-dev

References:
- [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j
  - From: jerven Bolleman
- Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j
  - From: jerven Bolleman
- Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j
  - From: Matthew Nguyen

Prev by Date: [rdf4j-dev] Timeline of 4.2.1 patch release
Next by Date: Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j
Previous by thread: Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j
Next by thread: Re: [rdf4j-dev] Contributing a write-once/read-many triple store to RDF4j
Index(es):
- Date
- Thread

Breadcrumbs