> For the complete documentation index, see [llms.txt](https://developer.mediarithmics.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://developer.mediarithmics.io/advanced-usages/data-warehouse-management/bigquery.md). # BigQuery ## Steps 1. Make sure the BigQuery project has the correct API enabled 2. Create dedicated datasets 3. Create a dedicated service account with the right set of permissions 4. Generate a credentials JSON file ### 1. Enable Cloud Resource Manager API * In the Google Cloud Console, navigate to the APIs & Services > Library page. * In the search bar, type "Cloud Resource Manager API". * Click on the search result for the Cloud Resource Manager API and click the Enable button ### 2. Create dedicated datasets We recommend to create two dedicated datasets for mediarithmics' usage. * **mics\_workspace** dataset that holds all temporary tables created while syncing with your warehouse * **mics\_output** dataset that holds all output tables — the clean results of daily processing, ready for consumption {% hint style="info" %} Note that the tables in mics\_output are not meant to be interface tables : their schema could change with successive releases. If you need a specific stable table schema to expose it to other tools, please contact your account manager. {% endhint %} ```sql CREATE SCHEMA IF NOT EXISTS `project-name.mics_workspace` OPTIONS ( location = 'EU', description = 'Working schema for mediarithmics' ); CREATE SCHEMA IF NOT EXISTS `project-name.mics_output` OPTIONS ( location = 'EU', description = 'Output schema for mediarithmics' ); ``` {% hint style="info" %} Nota bene : If you wish to use a [synchronization strategy](/data-ingestion/data-warehouse-ingestion.md#synchronization-strategies) that handles updates and deletes of data, you will need a [working dataset](/data-ingestion/data-warehouse-ingestion.md#working-schema) (i.e. the **mics\_workspace** dataset) {% endhint %} ### 3. Service account creation and permissions We recommend using a dedicated service account with the appropriate set of permissions. To create a service account follow these steps : * Go to IAM & Admin>Service Accounts * Select you project * Click on “Create service account”

* Input the necessary informations : service account name, service account id (automatically generated), description. Click on “Create and continue”

#### Grant access * We recommend giving the service account Read only access to your data by giving the following roles at the **project level** : * `bigquery.jobUser` * `bigquery.dataViewer` * In addition, the service account should also have the following role on the **mics\_output** and **mics\_workspace datasets** * `bigquery.dataEditor` {% hint style="info" %} #### Grant limited access If you do not want to give the `bigquery.dataViewer` role at the project level, you can assign it only to specific datasets, tables or views in your project. If you do that then you should add the following permissions at the **project** level : * `bigquery.datasets.get` * `bigquery.tables.list` * `bigquery.tables.get` To assign these customs permissions you should create a [custom role](https://cloud.google.com/iam/docs/creating-custom-roles#creating) than carries the `bigquery.jobUser` role and the `bigquery.datasets.get` `bigquery.tables.list` `bigquery.tables.get` permissions. If the `bigquery.dataViewer` role is even too much, you can assign the `bigquery.tables.getData` permission at a table level only. {% endhint %} ### 4. Generate a credentials JSON file To export a credentials JSON file follow these steps : * Go to * Select you project * Select your service account and on the “…” menu select “Manage keys” * Then click on “Add key”>”Create new key”

* Select JSON then “Create” * The key is automatically downloaded {% hint style="warning" %} Note you need to have at least the `roles/iam.serviceAccountKeyAdmin` role to perform these actions {% endhint %} --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://developer.mediarithmics.io/advanced-usages/data-warehouse-management/bigquery.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.