Have you purchased yours? Buy Your All-Access Pass now & get lifetime access!

Prashanth Rao

Cypher vs. SQL: When do you need a graph database?

A Talk by Prashanth Rao (AI Engineer, Kùzu Inc.)

Proudly supported by

About this Talk

Developers who work with data most likely have experience in some dialect of SQL, and are familiar with the relational data model. However, there are certain scenarios in which a graph data model can be beneficial.

In this masterclass, we will introduce the property graph data model and go over the key features of Cypher, a declarative query language for graphs. The goal is to demonstrate certain classes of queries whose logic is easier to express (and likely to run faster) when using a graph database that implements Cypher. Developers coming from relational databases and SQL will find Cypher to be a refreshingly familiar graph analogue to SQL.

To demonstrate these concepts, we transform an existing tabular dataset into a graph and ingest it into Kùzu, an open source, embedded graph database management system that implements Cypher. We will perform a comparison between SQL and Cypher queries that answer the same questions on the same data, highlighting both the similarities and the differences between them. As the class progresses, we will also demonstrate some methods to use LLMs to generate Cypher from user questions asked in natural language.

By the end of the masterclass, attendees will come away having a greater appreciation for data modelling and query workloads that could benefit from graphs, how to answer questions using Cypher, as well as the skills to construct and query their own knowledge graphs from a variety of sources that can power modern search/retrieval systems or RAG applications.

Key Topics

  • Property graph data model and the Cypher query language
  • Graph queries and traversals using Cypher
  • Flexible joins, recursive queries and pattern-matching queries in Cypher
  • Transforming tabular data into graphs
  • ETL and data transformation workflows for graphs
  • Using LLMs to query graphs

Target Audience

  • Data Scientists
  • Data Engineers
  • Machine Learning Engineers
  • Data Analysts
  • Managers of the above

Goals

Get hands-on experience using Cypher and interacting with a graph database to answer questions about your data.

Session outline:

  • Introduction to the property graph data model
  • Characteristics of Cypher
  • Examples of how some queries are hard to answer in SQL, but easy and intuitive in Cypher
  • Recursive querying and the notion of paths
  • Flexible join sequences
  • Query patterns in the data
  • Visualising graphs to gain insights
  • Practical demo: Running Cypher queries in Kùzu

Format

This class is hands-on.

The beginning of the class will start in a lecture format, but will quickly move to hands-on coding exercises. A combination of command line tools, Python scripts and Docker containers will be used during the session.

We will be working with Jupyter notebooks or Python scripts run from the command line (participants’ choice) with standard Python packages installed via pip. To visualise the graph, we use Docker to launch a visualisation tool called Kùzu Explorer, so it’s useful to have an installation of Docker on your machine.

Level

Beginner - Intermediate

Prerequisite Knowledge

Basic knowledge of Python and the command line (and an even more basic knowledge of Docker) is beneficial.

11 December 2024, 11:15 AM

Mix and Match Stage

11:15 AM - 01:15 PM

Add to Calendar

About The Speakers

Prashanth Rao

Prashanth Rao

AI Engineer, Kùzu Inc.

Prashanth Rao is an AI engineer at Kùzu Inc. Over the last several years, he's worked with numerous data systems, including graph DBs, with a focus on data engineering, analytics and machine learning to power a variety of applications.

Prashanth Rao

Location

Convene 133 Houndsditch

133 Houndsditch, London

Neo4j

Neo4j, the Graph Database & Analytics leader, helps organizations find hidden relationships and patterns across billions of data connections deeply, easily, and quickly.

Platinum Sponsor

Ontotext

Connect the dots of your data! Ontotext helps enterprises to lower data management costs by up to 30%, enable data fabric architectures, create digital twins, utilize Graph RAG benefits, and take information delivery from days to minutes!

Gold Sponsor

Semantic Web Company / PoolParty

The vendor of PoolParty Semantic Suite. Graph-based text mining, recommender systems, and data fabric solutions.

Gold Sponsor

yWorks

yWorks specializes in the development of professional software solutions that enable the clear visualization of diagrams and networks.

Gold Sponsor

Oracle

This global cloud tech company offers infrastructure and software to enhance organizational innovation and efficiency. Its autonomous database delivers top performance, security, and cost savings. Supporting businesses, governments, and research, the company embraces diversity and fosters growth.

Gold Sponsor

Ultipa

Ultipa builds next-gen graph XAI & real-time database empowering smart enterprises w/ smooth digital transformations.

Sliver Sponsor

Oxford Semantic Technologies

Oxford Semantic Technologies (OST) spun out from the University of Oxford and was acquired by Samsung in 2024. OST provides AI software to extract insights from big data, solving issues like medical diagnostics and financial crime. One founder is a BCS Lovelace Medal winner.

Sliver Sponsor

FlureeDB

Web3 data platform built on standards. Fluree powers connected, secure, and agile data ecosystems.

Bronze Sponsor

Senzing

Senzing is the first to deliver real-time, artificial intelligence for entity resolution. Senzing software enables organizations of all sizes to gain highly accurate and valuable insights about who is who and who is related to whom in data.

Bronze Sponsor

Semantic Partners

We partner with you, and your chosen semantic stack, to liberate your data's meaning from isolated silos.

Bronze Sponsor

Epsilla

All-in-one platform to create AI agents powered by your private data and knowledge. Make GenAI prototype to production 10 times faster. We are backed by Y Combinator. Start free today: https://epsilla.com

Bronze Sponsor

Neural Alpha

Since 2016 Neural Alpha have delivered cutting edge, sustainability centric Connected Data solutions for blue-chip corporates, financial institutions, Governments and NGOs. Our bespoke software & data solutions fuse AI, Knowledge Graphs, Taxonomies & other technologies for unprecedented insights.

Sliver Sponsor

Want to sponsor this event? Contact Us