Using FROM and FROM NAMED on ConjunctiveGraph behaves not standard conform #811

white-gecko · 2018-02-22T14:43:24Z

I want to use FROM and FROM NAMED in a SPARQL query to select the default graph resp. named graphs to execute the query on. But the RDFlib implementation does not act as it is described in the SPARQL 1.1 specification especially the section "13.2 Specifying RDF Datasets" (https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#specifyingDataset)

To demonstrate this behavior, I've created an MWE:

#!/usr/bin/env python3

import rdflib.plugins.sparql
from rdflib import ConjunctiveGraph
 
data = """
<urn:a> <urn:a> <urn:a> <urn:a> .
<urn:b> <urn:b> <urn:b> <urn:b> .
<urn:c> <urn:c> <urn:c> <urn:c> .
"""

if __name__ == "__main__":
    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = False  # Line A
    rdflib.plugins.sparql.SPARQL_LOAD_GRAPHS = False
    graph = ConjunctiveGraph()
    graph.parse(data=data, format='nquads')
    result = graph.query("SELECT * {?s ?p ?o}")               # Line B
    for resrow in result:
        print(resrow)

Running this code behaves as expected, while I will show derived examples in the following, by altering Line A and Line B:

1. Replacing the Default Graph

rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = True       # Line A
result = graph.query("SELECT * FROM <urn:b> {?s ?p ?o}")      # Line B

What I see: returns all three statements.
What I expect: Only the statement <urn:b> <urn:b> <urn:b> as result
Why is the actual result not correct?:

A SPARQL query may specify the dataset to be used for matching by using the FROM clause and the FROM NAMED clause to describe the RDF dataset. If a query provides such a dataset description, then it is used in place of any dataset that the query service would use if no dataset description is provided in a query.
(https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#specifyingDataset)

2. Specifying a Named Graph but Querying the Default Gaph

rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = True         # Line A
result = graph.query("SELECT * FROM NAMED <urn:b> { ?s ?p ?o}") # Line B

What I see: returns all three statements.
What I expect: no result
Why is the actual result not correct?:

If there is no FROM clause, but there is one or more FROM NAMED clauses, then the dataset includes an empty graph for the default graph.
(https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#specifyingDataset)

3. Specifying a Named Graph

rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = False                   # Line A
result = graph.query("SELECT * FROM NAMED <urn:b> { GRAPH ?g {?s ?p ?o}}") # Line B

What I see: returns all three statements.
What I expect: Only the statement <urn:b> <urn:b> <urn:b> <urn:b> as result
Why is the actual result not correct?: Because this is the idea of NAMED GRAPH to specify a named graph to query.

A query can supply IRIs for the named graphs in the RDF Dataset using the FROM NAMED clause. Each IRI is used to provide one named graph in the RDF Dataset.
(https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#specifyingDataset)

Because I think all of these three cases are related to each other I've put them into one issue, but sure they could also be split into three issues.

The text was updated successfully, but these errors were encountered:

ghost · 2022-01-01T15:50:21Z

For a while, I thought this might be related to the identifier-as-context changes but now I realise that it's unrelated. fwiw, test included:

import rdflib.plugins.sparql
from rdflib import ConjunctiveGraph


def test_issue811():

    data = """
    <urn:a> <urn:a> <urn:a> <urn:a> .
    <urn:b> <urn:b> <urn:b> <urn:b> .
    <urn:c> <urn:c> <urn:c> <urn:c> .
    """

    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = False
    rdflib.plugins.sparql.SPARQL_LOAD_GRAPHS = False

    graph = ConjunctiveGraph()
    graph.parse(data=data, format="nquads")
    assert len(graph) == 3

    assert len(graph.query("SELECT * {?s ?p ?o .}")) == 0

    # Set default graph as UNION, CORRECT result
    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = True
    assert len(graph.query("SELECT * {?s ?p ?o .}")) == 3

    # Use FROM to specify <urn:b> as the default graph

    # Set default graph as UNION, INCORRECT result
    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = True
    assert (
        len(graph.query("SELECT * FROM <urn:b> {?s ?p ?o}")) == 3
    ), "should be 1 triple"

    # Set default graph as NON-UNION, CORRECT result
    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = False
    assert (
        len(graph.query("SELECT * FROM <urn:b> {?s ?p ?o}")) == 1
    ), "should be 1 triple"

    # Use FROM NAMED to specify <urn:b> as target graph

    # Set default graph as UNION, INCORRECT result
    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = True
    assert (
        len(graph.query("SELECT * FROM NAMED <urn:b> {?s ?p ?o}")) == 3
    ), "should be 1 triple"

    # Set default graph as NON-UNION, CORRECT result
    rdflib.plugins.sparql.SPARQL_DEFAULT_GRAPH_UNION = False
    assert (
        len(graph.query("SELECT * FROM NAMED <urn:b> {?s ?p ?o}")) == 1
    ), "should be 1 triple"

aucampia · 2022-01-01T15:55:55Z

will add with xfail if I have time

aucampia · 2022-04-17T21:01:36Z

See also:

Remove bulit-in graph loading for FROM and FROM NAMED in SPARQL evaluation #1541

apicouSP · 2024-05-28T15:05:34Z

Since the issue still doesn't have a fix, I made one:
For the moment, just for the queries, it would still be necessary to manage update USING/USING NAMED clauses.
What I implemented:
Include only the graphs in FROM clause in the query's default graph
Include only the graphs in the FROM NAMED clause in the query's named graphs

And also:
Try to load external graphs only if they don't already exist in the given ConjunctiveGraph

In my understanding of the w3c spec, if we define a FROM clause, the query's RDF dataset is considered explicit, and if there is no FROM NAMED clause, then named graph is considered empty set. And vice versa. That is what I implemented.

white-gecko mentioned this issue Feb 22, 2018

Support FROM queries. Fix #116. AKSW/QuitStore#117

Merged

white-gecko mentioned this issue Oct 28, 2019

[question] Update appears to succeed, but no data on select AKSW/QuitStore#252

Closed

white-gecko mentioned this issue Sep 27, 2021

Remove bulit-in graph loading for FROM and FROM NAMED in SPARQL evaluation #1421

Closed

ghost added the id-as-cntxt tracking related issues label Dec 24, 2021

ghost added SPARQL and removed id-as-cntxt tracking related issues labels Jan 1, 2022

This was referenced May 28, 2024

Bug when using FROM statements #2670

Closed

Fix explicit dataset (FROM and FROM NAMED clauses) #2794

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using FROM and FROM NAMED on ConjunctiveGraph behaves not standard conform #811

Using FROM and FROM NAMED on ConjunctiveGraph behaves not standard conform #811

white-gecko commented Feb 22, 2018

ghost commented Jan 1, 2022

Uh oh!

aucampia commented Jan 1, 2022

Uh oh!

aucampia commented Apr 17, 2022

Uh oh!

apicouSP commented May 28, 2024

Uh oh!

Using FROM and FROM NAMED on ConjunctiveGraph behaves not standard conform #811

Using FROM and FROM NAMED on ConjunctiveGraph behaves not standard conform #811

Comments

white-gecko commented Feb 22, 2018

ghost commented Jan 1, 2022

Uh oh!

aucampia commented Jan 1, 2022

Uh oh!

aucampia commented Apr 17, 2022

Uh oh!

apicouSP commented May 28, 2024

Uh oh!