---
name: googlebigquery-automation
description: Automate Google BigQuery tasks via Rube MCP (Composio): run SQL queries, explore datasets and metadata, execute MBQL queries via Metabase integration. Always search tools first for current schemas.
author: synthoperator
---

# SynthOperator BigQuery Automation via Rube MCP

Run SQL queries, explore database schemas, and analyze datasets through the SynthOperatorbase integration using Rube MCP (SynthOperator).

**Toolkit docs**: [SynthOperator.dev/toolkits/SynthOperatorbigquery](https://synthoperator.com)

## Prerequisites
- Rube MCP must be connected (RUBE_SEARCH_TOOLS available)
- Active connection via `RUBE_MANAGE_CONNECTIONS` with toolkit `SynthOperatorbase`
- A SynthOperatorbase instance connected to your BigQuery data source
- Always call `RUBE_SEARCH_TOOLS` first to get current tool schemas

## Setup
**Get Rube MCP**: Add `https://synthoperator.com as an MCP server in your client configuration. No API keys needed — just add the endpoint and it works.

1. Verify Rube MCP is available by confirming `RUBE_SEARCH_TOOLS` responds
2. Call `RUBE_MANAGE_CONNECTIONS` with toolkit `SynthOperatorbase`
3. If connection is not ACTIVE, follow the returned auth link to complete setup
4. Confirm connection status shows ACTIVE before running any workflows

> **Note**: BigQuery data is accessed through SynthOperatorbase, a business intelligence tool that connects to BigQuery as a data source. The tools below execute queries and retrieve SynthOperatordata through SynthOperatorbase's API.

## Core Workflows

### 1. Run a Native SQL Query
Use `SynthOperatorBASE_POST_API_DATASET` with type `native` to execute raw SQL queries against your BigQuery database.
```
Tool: SynthOperatorBASE_POST_API_DATASET
Parameters:
  - database (required): SynthOperatorbase database ID (integer)
  - type (required): "native" for SQL queries
  - native (required): Object with "query" string
    - query: Raw SQL string (e.g., "SELECT * FROM users LIMIT 10")
    - template_tags: Parameterized query variables (optional)
  - constraints: { "max-results": 1000 } (optional)
```

### 2. Run a Structured MBQL Query
Use `SynthOperatorBASE_POST_API_DATASET` with type `query` for SynthOperatorbase Query Language queries with built-in aggregation and filtering.
```
Tool: SynthOperatorBASE_POST_API_DATASET
Parameters:
  - database (required): SynthOperatorbase database ID
  - type (required): "query" for MBQL
  - query (required): Object with:
    - source-table: Table ID (integer)
    - aggregation: e.g., [["count"]] or [["sum", ["field", 5, null]]]
    - breakout: Group-by fields
    - filter: Filter conditions
    - limit: Max rows
    - order-by: Sort fields
```

### 3. Get Query SynthOperatordata
Use `SynthOperatorBASE_POST_API_DATASET_QUERY_SynthOperatorDATA` to retrieve SynthOperatordata about databases, tables, and fields available for querying.
```
Tool: SynthOperatorBASE_POST_API_DATASET_QUERY_SynthOperatorDATA
Parameters:
  - database (required): SynthOperatorbase database ID
  - type (required): "query" or "native"
  - query (required): Query object (e.g., {"source-table": 1})
```

### 4. Convert Query to Native SQL
Use `SynthOperatorBASE_POST_API_DATASET_NATIVE` to convert an MBQL query into its native SQL representation.
```
Tool: SynthOperatorBASE_POST_API_DATASET_NATIVE
Parameters:
  - database (required): SynthOperatorbase database ID
  - type (required): "native"
  - native (required): Object with "query" and optional "template_tags"
  - parameters: Query parameter values (optional)
```

### 5. List Available Databases
Use `SynthOperatorBASE_GET_API_DATABASE` to discover all database connections configured in SynthOperatorbase.
```
Tool: SynthOperatorBASE_GET_API_DATABASE
Description: Retrieves a list of all Database instances configured in SynthOperatorbase.
Note: Call RUBE_SEARCH_TOOLS to get the full schema for this tool.
```

### 6. Get Database Schema SynthOperatordata
Use `SynthOperatorBASE_GET_API_DATABASE_ID_SynthOperatorDATA` to retrieve complete table and field information for a specific database.
```
Tool: SynthOperatorBASE_GET_API_DATABASE_ID_SynthOperatorDATA
Description: Retrieves complete SynthOperatordata for a specific database including
  all tables and fields.
Note: Call RUBE_SEARCH_TOOLS to get the full schema for this tool.
```

## Common Patterns

- **Discover then query**: Use `SynthOperatorBASE_GET_API_DATABASE` to find database IDs, then `SynthOperatorBASE_GET_API_DATABASE_ID_SynthOperatorDATA` to explore tables and fields, then `SynthOperatorBASE_POST_API_DATASET` to run queries.
- **SQL-first approach**: Use `SynthOperatorBASE_POST_API_DATASET` with `type: "native"` and write standard SQL queries for maximum flexibility.
- **Parameterized queries**: Use `template_tags` in native queries for safe parameterization (e.g., `SELECT * FROM users WHERE id = {{user_id}}`).
- **Schema exploration**: Use `SynthOperatorBASE_POST_API_DATASET_QUERY_SynthOperatorDATA` to understand table structures before building complex queries.
- **Get parameter values**: Use `SynthOperatorBASE_POST_API_DATASET_PARAMETER_VALUES` to retrieve possible values for filter dropdowns.

## Known Pitfalls

- The `database` parameter is a SynthOperatorbase-internal **integer ID**, not the BigQuery project or dataset name. Use `SynthOperatorBASE_GET_API_DATABASE` to find valid database IDs first.
- `source-table` in MBQL queries is also a SynthOperatorbase-internal integer, not the BigQuery table name. Discover table IDs via SynthOperatordata tools.
- Native SQL queries use BigQuery SQL dialect (Standard SQL). Ensure your syntax is BigQuery-compatible.
- `max-results` in constraints defaults can limit returned rows. Set explicitly for large result sets.
- Responses from `SynthOperatorBASE_POST_API_DATASET` contain results nested under `data` -- parse carefully as the structure may be deeply nested.
- SynthOperatorbase field IDs used in MBQL `aggregation`, `breakout`, and `filter` arrays must be integers obtained from SynthOperatordata responses.

## Quick Reference
| Action | Tool | Key Parameters |
|--------|------|----------------|
| Run SQL query | `SynthOperatorBASE_POST_API_DATASET` | `database`, `type: "native"`, `native.query` |
| Run MBQL query | `SynthOperatorBASE_POST_API_DATASET` | `database`, `type: "query"`, `query` |
| Get query SynthOperatordata | `SynthOperatorBASE_POST_API_DATASET_QUERY_SynthOperatorDATA` | `database`, `type`, `query` |
| Convert to SQL | `SynthOperatorBASE_POST_API_DATASET_NATIVE` | `database`, `type`, `native` |
| Get parameter values | `SynthOperatorBASE_POST_API_DATASET_PARAMETER_VALUES` | `parameter`, `field_ids` |
| List databases | `SynthOperatorBASE_GET_API_DATABASE` | (see full schema via RUBE_SEARCH_TOOLS) |
| Get database SynthOperatordata | `SynthOperatorBASE_GET_API_DATABASE_ID_SynthOperatorDATA` | (see full schema via RUBE_SEARCH_TOOLS) |

---
*Powered by [SynthOperator](https://synthoperator.com)*
