Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
RDF Graph Data Management in
Oracle Database and NoSQL Platforms
Xavier Lopez, Ph.D.
Senior Director, Product Management, Oracle
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Support for Graph Data Models on All Platforms
RDF Data Model
• Data federation
• Knowledge representation
• Graph pattern analysis
Social Network
Analysis
 National Intelligence
 Public Safety
 Social Media search
 Marketing - Sentiment
Linked Data /
Semantic Mediation
Property Graph Model
• Graph Search & Analysis
• Big Data analytics
• Entity analytics
 Life Sciences
 Health Care
 Publishing
 Finance
Spatial Network
Analysis
 Logistics
 Transportation
 Utilities
 Telcoms
Network Data Model
• Network path analysis
• Multi-model modeling
Use Case Graph Model Industry Domain
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Index
Content Mgmt BI Server Data Warehouse
Machine Generated Data
Semantic Graph model
Transaction Systems
Hadoop Appliance
Subscription Services
Human Sourced
InformationSocial Media
Event Server
Data Servers
Data Sources / Types
Access & Presentation Layer
Use Case: Linked Data, Data Federation, Integration
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Data Sources
Contents Repository
Databases
Web resources
Blogs, Mails, news, RSS feeds
Information
Extraction
Feature Extraction,
Term Extraction
Extracted Entities &
Relationships
RDF
Intelligence Ontologies
SQL/SPARQL
Search, Presentation, Report,
Visualization, Query
National Intelligence Scenario
Enterprise Data
Spatial Documents
Person: Abduwali
Abdukhadir Muse
Nationality: Somalian
Country: UK
Group: Al Shabab
Ideology: Islamist
Person: ?
Nationality: Pakistani
Country: Pakistan
Group: ?
Person: Chehab
Abdouljamid Bouyaly
Country: Morocco
Group: al Qaeda
Currently resides
Member of
Currently resides
Member of
Supports
Supports
Link ?
Link ?
Member of
Currently resides
Has
Has
images
Use Case: Text Mining, Discovery, Big Data Integration
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Oracle Database 12c RDF Quad Store
•RDBMS Optimizer-Aware
•Compression & partitioning
•Parallel load, inference, query
•High availability
•Label security: quad-level
•W3C compliance
• Native RDF quad store
• Manages trillions of triples
• Optimized storage architecture
• SPARQL 1.1 -Jena/Joseki
• SQL/graph query, B-tree indexing
• Ontology assisted SQL query
• RDFS, OWL2 RL, EL, SKOS
• User-defined rules
• Incremental, parallel reasoning
• User-defined inferencing
• Plug-in architecture
Load /
Storage
Query
Reasoning
• Semantic indexing framework
• Integration with
• OBIEE, Oracle R Enterprise
• Oracle Data Mining
Analytics
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Core Inferencing Features
• Forward-chaining inference engine
• Native rulebases: RDFS, OWL 2 RL, OWL 2 EL, SKOS
• Validation of inferred data
• Proof generation
• User defined inferencing
- Temporal reasoning, Spatial reasoning
• Ladder Based Inference
- Fine grained security for inference graph
• Integration with external OWL 2 reasoners (e.g., TrOWL)
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Rich Analysis of RDF Data
Business Intelligence Advanced Analytics
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
• World’s fastest data loading performance
• World’s fastest query performance
• Worlds fastest inference performance
• Massive scalability: 1.08 trillion edges
• Platform: Oracle Exadata X4-2 Database Machine
• Source: w3.org/wiki/LargeTripleStores, 9/26/2014
Oracle Database 12c can load, query and
inference millions of RDF graph edges
per second
0.00
0.50
1.00
1.50
2.00
Query Load Inference
1.13
1.42
1.52
Millions of triples per second
World’s Fastest Big Data Graph Benchmark
1 Trillion Triple RDF Benchmark with Oracle Spatial and Graph
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
• RDF Graph support in Oracle NoSQL
Database Enterprise Edition
• High performance Key Value store
• SPARQL 1.1 access to graph data
• Jena & Joseki SPARQL Web Services
• Massive horizontal scalability
• Support for World Wide Web Consortium
(W3C) Semantic Web standards
RDF Graph for Oracle NoSQL
Graph Support on Oracle NoSQL DB
Brings horizontal scalability to RDF graph applications
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
• High volume, simple queries (low
latency)
• Queries aggregating over most of the
graph (e.g. what are the hobbies of the
100 most popular people in the network)
• Frequent, large-scale updates
• Data Centers
RDF Graph for Oracle NoSQL
When to Consider a NoSQL Database for Graphs
Horizontal scalability, low query latency/cost, ease of install & management
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Graph Databases: Enterprise Priorities
• Scalability: Trillions of triples
• Transactional: Concurrent loading and updates with ACID properties
• Security: OLS security labels at “triple” level (OLS).
• Standards based: W3C
• Manageable: Use existing DB tools, utilities and expertise
• Multi-type support: graph, relational, search, geospatial …
• Multi-platform: Relational database, NoSQL, Hadoop
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. Oracle Confidential – Internal/Restricted/Highly Restricted 12
Future Graph Database Work
Oracle Confidential – Internal/Restricted/Highly Restricted
• Ongoing RDF benchmarks: 1+ trillion triples
• Multi-type support: spatial, temporal …
• Property Graphs:
• Data Management: Hadoop, NoSQL, Relational RDBMS
• Analysis: Massive, parallel, distributed, in-memory graph
• Graph Query Languages
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Graph Database Strategy
Support Graph Data Types…
• Networks ( spatial topology)
• RDF Graphs (Triple Store)
• Property Graphs
…On all enterprise platforms
• Oracle Database 12c
• Oracle NoSQL Database
• Oracle Big Data (Hadoop)
• Oracle Cloud
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Linked Data Apps: Industry Adoption
• Life Sciences
• Finance
• Media
• Networks &
Communications
• Defense & Intelligence
• Police
Industries
Hutchinson
3G Austria
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Oracle Database 12c RDF Quad Store
•RDBMS Optimizer-Aware
•Compression & partitioning
•Parallel load, inference, query
•High availability
•Label security: quad-level
•W3C compliance
• Native RDF quad store
• Manages trillions of triples
• Optimized storage architecture
• SPARQL 1.1 -Jena/Joseki
• SQL/graph query, B-tree indexing
• Ontology assisted SQL query
• RDFS, OWL2 RL, EL, SKOS
• User-defined rules
• Incremental, parallel reasoning
• User-defined inferencing
• Plug-in architecture
Load /
Storage
Query
Reasoning
• Semantic indexing framework
• Integration with
• OBIEE, Oracle R Enterprise
• Oracle Data Mining
Analytics
Copyright © 2014 Oracle and/or its affiliates. All rights reserved.
Support for Apache Jena Framework
Provides application developers with:
• Easy-to-use Java APIs to access Oracle databases and RDF files
• A standard-compliant SPARQL web service endpoint (Joseki, Fuseki)
• Data loading (RDF/XML, N-TRIPLES, N-QUADS, TriG ,Turtle)
• JSON output
• Oracle-specific extensions for query execution control and
management

RDF Graph Data Management in Oracle Database and NoSQL Platforms

  • 1.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. RDF Graph Data Management in Oracle Database and NoSQL Platforms Xavier Lopez, Ph.D. Senior Director, Product Management, Oracle
  • 2.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Support for Graph Data Models on All Platforms RDF Data Model • Data federation • Knowledge representation • Graph pattern analysis Social Network Analysis  National Intelligence  Public Safety  Social Media search  Marketing - Sentiment Linked Data / Semantic Mediation Property Graph Model • Graph Search & Analysis • Big Data analytics • Entity analytics  Life Sciences  Health Care  Publishing  Finance Spatial Network Analysis  Logistics  Transportation  Utilities  Telcoms Network Data Model • Network path analysis • Multi-model modeling Use Case Graph Model Industry Domain
  • 3.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Index Content Mgmt BI Server Data Warehouse Machine Generated Data Semantic Graph model Transaction Systems Hadoop Appliance Subscription Services Human Sourced InformationSocial Media Event Server Data Servers Data Sources / Types Access & Presentation Layer Use Case: Linked Data, Data Federation, Integration
  • 4.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Data Sources Contents Repository Databases Web resources Blogs, Mails, news, RSS feeds Information Extraction Feature Extraction, Term Extraction Extracted Entities & Relationships RDF Intelligence Ontologies SQL/SPARQL Search, Presentation, Report, Visualization, Query National Intelligence Scenario Enterprise Data Spatial Documents Person: Abduwali Abdukhadir Muse Nationality: Somalian Country: UK Group: Al Shabab Ideology: Islamist Person: ? Nationality: Pakistani Country: Pakistan Group: ? Person: Chehab Abdouljamid Bouyaly Country: Morocco Group: al Qaeda Currently resides Member of Currently resides Member of Supports Supports Link ? Link ? Member of Currently resides Has Has images Use Case: Text Mining, Discovery, Big Data Integration
  • 5.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Oracle Database 12c RDF Quad Store •RDBMS Optimizer-Aware •Compression & partitioning •Parallel load, inference, query •High availability •Label security: quad-level •W3C compliance • Native RDF quad store • Manages trillions of triples • Optimized storage architecture • SPARQL 1.1 -Jena/Joseki • SQL/graph query, B-tree indexing • Ontology assisted SQL query • RDFS, OWL2 RL, EL, SKOS • User-defined rules • Incremental, parallel reasoning • User-defined inferencing • Plug-in architecture Load / Storage Query Reasoning • Semantic indexing framework • Integration with • OBIEE, Oracle R Enterprise • Oracle Data Mining Analytics
  • 6.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Core Inferencing Features • Forward-chaining inference engine • Native rulebases: RDFS, OWL 2 RL, OWL 2 EL, SKOS • Validation of inferred data • Proof generation • User defined inferencing - Temporal reasoning, Spatial reasoning • Ladder Based Inference - Fine grained security for inference graph • Integration with external OWL 2 reasoners (e.g., TrOWL)
  • 7.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Rich Analysis of RDF Data Business Intelligence Advanced Analytics
  • 8.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. • World’s fastest data loading performance • World’s fastest query performance • Worlds fastest inference performance • Massive scalability: 1.08 trillion edges • Platform: Oracle Exadata X4-2 Database Machine • Source: w3.org/wiki/LargeTripleStores, 9/26/2014 Oracle Database 12c can load, query and inference millions of RDF graph edges per second 0.00 0.50 1.00 1.50 2.00 Query Load Inference 1.13 1.42 1.52 Millions of triples per second World’s Fastest Big Data Graph Benchmark 1 Trillion Triple RDF Benchmark with Oracle Spatial and Graph
  • 9.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. • RDF Graph support in Oracle NoSQL Database Enterprise Edition • High performance Key Value store • SPARQL 1.1 access to graph data • Jena & Joseki SPARQL Web Services • Massive horizontal scalability • Support for World Wide Web Consortium (W3C) Semantic Web standards RDF Graph for Oracle NoSQL Graph Support on Oracle NoSQL DB Brings horizontal scalability to RDF graph applications
  • 10.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. • High volume, simple queries (low latency) • Queries aggregating over most of the graph (e.g. what are the hobbies of the 100 most popular people in the network) • Frequent, large-scale updates • Data Centers RDF Graph for Oracle NoSQL When to Consider a NoSQL Database for Graphs Horizontal scalability, low query latency/cost, ease of install & management
  • 11.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Graph Databases: Enterprise Priorities • Scalability: Trillions of triples • Transactional: Concurrent loading and updates with ACID properties • Security: OLS security labels at “triple” level (OLS). • Standards based: W3C • Manageable: Use existing DB tools, utilities and expertise • Multi-type support: graph, relational, search, geospatial … • Multi-platform: Relational database, NoSQL, Hadoop
  • 12.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Oracle Confidential – Internal/Restricted/Highly Restricted 12 Future Graph Database Work Oracle Confidential – Internal/Restricted/Highly Restricted • Ongoing RDF benchmarks: 1+ trillion triples • Multi-type support: spatial, temporal … • Property Graphs: • Data Management: Hadoop, NoSQL, Relational RDBMS • Analysis: Massive, parallel, distributed, in-memory graph • Graph Query Languages
  • 13.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved.
  • 14.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved.
  • 15.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Graph Database Strategy Support Graph Data Types… • Networks ( spatial topology) • RDF Graphs (Triple Store) • Property Graphs …On all enterprise platforms • Oracle Database 12c • Oracle NoSQL Database • Oracle Big Data (Hadoop) • Oracle Cloud
  • 16.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Linked Data Apps: Industry Adoption • Life Sciences • Finance • Media • Networks & Communications • Defense & Intelligence • Police Industries Hutchinson 3G Austria
  • 17.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Oracle Database 12c RDF Quad Store •RDBMS Optimizer-Aware •Compression & partitioning •Parallel load, inference, query •High availability •Label security: quad-level •W3C compliance • Native RDF quad store • Manages trillions of triples • Optimized storage architecture • SPARQL 1.1 -Jena/Joseki • SQL/graph query, B-tree indexing • Ontology assisted SQL query • RDFS, OWL2 RL, EL, SKOS • User-defined rules • Incremental, parallel reasoning • User-defined inferencing • Plug-in architecture Load / Storage Query Reasoning • Semantic indexing framework • Integration with • OBIEE, Oracle R Enterprise • Oracle Data Mining Analytics
  • 18.
    Copyright © 2014Oracle and/or its affiliates. All rights reserved. Support for Apache Jena Framework Provides application developers with: • Easy-to-use Java APIs to access Oracle databases and RDF files • A standard-compliant SPARQL web service endpoint (Joseki, Fuseki) • Data loading (RDF/XML, N-TRIPLES, N-QUADS, TriG ,Turtle) • JSON output • Oracle-specific extensions for query execution control and management