Working with joins in LookML

Joins let you connect different views so that you can explore data from more than one view at the same time and see how different parts of your data relate to each other.

For example, your database might include the tables order_items, orders, and users. You can use joins to explore data from all tables at the same time. This page explains joins in LookML, including specific join parameters and joining patterns.

Joins start with an Explore

Joins are defined in the model file to establish the relationship between an Explore and a view. Joins connect one or more views to a single Explore, either directly, or through another joined view.

Let's consider two database tables: order_items and orders. After you generate views for both tables, declare one or more of them under the explore parameter in the model file:

explore: order_items { ... }

When you run a query from the order_items Explore, order_items appears in the FROM clause of the generated SQL:

SELECT ...
FROM order_items

You can join additional information to the order_items Explore. For example, you can use the following sample LookML to join the orders view to the order_items Explore:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} ;;
  }
}

The LookML above accomplishes two things. First, you can see fields from both orders and order_items in the UI:

The Order Items Explore includes the fields from the Order Items view and the fields from the joined Orders view.

Second, the LookML describes how to join orders and order_items together. That LookML would translate to the following SQL:

SELECT ...
FROM order_items
LEFT JOIN orders
ON order_items.order_id = orders.id

These LookML parameters are described in greater detail in the following sections. See the join parameter reference page to learn more about how this LookML is translated into SQL.

Chat Team Tip: Users ask most about the validation error, "Unknown or inaccessible field," which can be caused by a missing join. See the Best Practices page about this error for more information.

Join parameters

Four main parameters are used to join: join, type, relationship, and sql_on.

Step 1: Starting the Explore

First, create the order_items Explore:

explore: order_items { ... }

Step 2: join

To join a table, you must first declare it in a view. In this example, orders is an existing view in your model.

Then, use the join parameter to declare that you want to join the orders view to order_items:

explore: order_items {
  join: orders { ... }
}

Step 3: type

Consider which type of join to perform. Looker supports LEFT JOIN, INNER JOIN, FULL OUTER JOIN, and CROSS JOIN. These correspond to the type parameter values of left_outer, inner, full_outer, and cross.

explore: order_items {
  join: orders {
    type: left_outer
  }
}

The default value of type is left_outer, and generally the popular join type.

Step 4: relationship

Define a join relationship between order_items and orders. Properly declaring the relationship of a join is important for Looker to calculate accurate measures. The relationship is defined from the order_items Explore to the orders view. The possible options are one_to_one, many_to_one, one_to_many, and many_to_many.

In this example, there can be many order_items for a single order. The relationship from order_items to orders is many_to_one:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
  }
}

If you do not include a relationship in your join, Looker defaults to many_to_one.

For additional tips on defining the relationship parameter correctly for a join, see the Getting the relationship parameter right Best Practices page.

Step 5: sql_on

Declare how to join these two tables together with either the sql_on or the foreign_key parameter. We usually suggest sql_on since it can do everything foreign_key can do, but is typically easier to understand.

sql_on is equivalent to the ON clause in the generated SQL for a query. With this parameter, we can declare which fields should be matched up to perform the join:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} ;;
  }
}

You can also write more complex joins. For example, you may want to join only orders with id greater than 1000:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} AND ${orders.id} > 1000 ;;
  }
}

Check out substitution operators to learn more about the ${ ... } syntax in these examples.

Step 6: Testing

Test that this join is functioning as expected by going to the Order Items Explore. You should see fields from both order_items and orders.

See Model Development to learn more about testing LookML changes.

Joining through another view

You can join a view to an Explore through another view. In the example above, you joined orders to order_items via the order_id field. We might also want to join the data from a view called users to the order_items Explore, even though they don't share a common field. This can be done by joining through the orders view.

Use sql_on or foreign_key to join users to orders instead of order_items. Do this by correctly scoping the field from orders as orders.user_id.

Here is an example using sql_on:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} ;;
  }
  join: users {
    type: left_outer
    relationship: many_to_one
    sql_on: ${orders.user_id} = ${users.id} ;;
  }
}

Joining a view more than once

A users view contains data for both buyers and sellers. To join data from this view into order_items, but do so separately for buyers and sellers, you can join users twice, with different names, using the from parameter.

The from parameter lets you specify which view to use in a join, while giving the join a unique name. For example:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} ;;
  }
  join: buyers {
    from: users
    type: left_outer
    relationship: many_to_one
    sql_on: ${orders.buyer_id} = ${buyers.id} ;;
  }
  join: sellers {
    from: users
    type: left_outer
    relationship: many_to_one
    sql_on: ${orders.seller_id} = ${sellers.id} ;;
  }
}

In this case, only buyer data is joined as buyers, while only seller data is joined in as sellers.

Note: The users view must now be referred to by its aliased names buyers and sellers in the join.

Limiting fields from a join

The fields parameter lets you specify which fields are brought from a join into an Explore. By default, all fields from a view are brought in when joined. However, you might want to bring through only a subset of fields.

For example, when orders is joined to order_items, you may want to bring only the shipping and tax fields through the join:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} ;;
    fields: [shipping, tax]
  }
}

You can also reference a set of fields, such as [set_a*]. Each set is defined within a view using the set parameter. Suppose you have the following set defined in the orders view:

set: orders_set {
  fields: [created_date, shipping, tax]
}

You can choose to bring only these three fields through when you join orders to order_items:

explore: order_items {
  join: orders {
    type: left_outer
    relationship: many_to_one
    sql_on: ${order_items.order_id} = ${orders.id} ;;
    fields: [orders_set*]
  }
}

Symmetric aggregates

Looker uses a feature called "symmetric aggregates" to calculate aggregations (like sums and averages) correctly, even when joins result in a fanout. Symmetric aggregates are described in more detail in Understanding symmetric aggregates Best Practices page, and the fanout problem they solve is explained in The problem of SQL fanouts Community post.

Primary keys required

To have measures (aggregations) come through joins, you must define primary keys in all views that are involved in the join.

Do this by adding the primary_key parameter to the primary key field definition in each view:

dimension: id {
  type: number
  primary_key: yes
}

To correctly handle joined measures, Looker relies on you specifying a primary key where the values are completely unique, non-NULL values. If your data does not contain a primary key, consider whether the concatenation of several fields would result in a primary key of completely unique, non-NULL values. If your primary key is not unique or contains NULL values and your query includes data that reveal those issues, then Looker returns an error as described on the Error: Non-Unique value/primary key (or sql_distinct_key), value overflow or collision when computing sum Best Practices page.

Supported SQL dialects

For Looker to support symmetric aggregates in your Looker project, your database dialect must also support them. The following table shows which dialects support symmetric aggregates in the latest release of Looker:

Dialect Supported?
Actian Avalanche
Yes
Amazon Athena
Yes
Amazon Aurora MySQL
Yes
Amazon Redshift
Yes
Apache Druid
No
Apache Druid 0.13+
No
Apache Druid 0.18+
No
Apache Hive 2.3+
No
Apache Hive 3.1.2+
No
Apache Spark 3+
Yes
ClickHouse
No
Cloudera Impala 3.1+
Yes
Cloudera Impala 3.1+ with Native Driver
Yes
Cloudera Impala with Native Driver
No
DataVirtuality
Yes
Databricks
Yes
Denodo 7
Yes
Denodo 8
Yes
Dremio
No
Dremio 11+
Yes
Exasol
Yes
Firebolt
Yes
Google BigQuery Legacy SQL
Yes
Google BigQuery Standard SQL
Yes
Google Cloud PostgreSQL
Yes
Google Cloud SQL
Yes
Google Spanner
Yes
Greenplum
Yes
HyperSQL
No
IBM Netezza
Yes
MariaDB
Yes
Microsoft Azure PostgreSQL
Yes
Microsoft Azure SQL Database
Yes
Microsoft Azure Synapse Analytics
Yes
Microsoft SQL Server 2008+
Yes
Microsoft SQL Server 2012+
Yes
Microsoft SQL Server 2016
Yes
Microsoft SQL Server 2017+
Yes
MongoBI
No
MySQL
Yes
MySQL 8.0.12+
Yes
Oracle
Yes
Oracle ADWC
Yes
PostgreSQL 9.5+
Yes
PostgreSQL pre-9.5
Yes
PrestoDB
Yes
PrestoSQL
Yes
SAP HANA 2+
Yes
SingleStore
Yes
SingleStore 7+
Yes
Snowflake
Yes
Teradata
Yes
Trino
Yes
Vector
Yes
Vertica
Yes

If your dialect does not support symmetric aggregates, be careful when executing joins in Looker, as some types of joins can result in inaccurate aggregations (like sums and averages). This problem and the workarounds for it are described in great detail in The problem of SQL fanouts Community post.

Learn more about joins

To learn more about join parameters in LookML, see the Join reference documentation.