MicroStrategy ONE

Connect to Google Cloud Storage

Google Cloud Storage is an online service for storing and accessing data in the Google Cloud Platform. MicroStrategy Cloud Object Connector provides access to Google Cloud Storage to quickly browse folders and files and import them into MicroStrategy cubes.

Explore the following topics on this page:

  1. Prepare Connection Parameters
  2. Create a DBRole
  3. Import Data
  4. Limitations

Prepare Connection Parameters

For Cloud Object Connector to successfully browse the Google Cloud Storage file system, you need a service account created with proper permissions. Permission combinations are not available from a basic role in Google Cloud. MicroStrategy recommends creating a custom role for the Google Cloud connector.

The following permissions should be added to the custom role:

  • storage.buckets.get
  • storage.buckets.list
  • storage.objects.get
  • storage.objects.list

After the service account is successfully created, please ask your administrator for the key. The key is in the form of a JSON file and will be needed for future use.

Create a DBRole

Access the Google Cloud Storage Cloud Object Connector in MicroStrategy Web or Workstation.

  1. Choose Add Data > New Data.
  2. Find and select the Google Cloud Storage Cloud Object connector from the data source list.

  3. Next to Data Sources, click New Data Source to add a new connection.

  4. Enter your connection credentials.

    • Connection Name: A name for the new connection.
    • Project ID: The project ID from the service account key JSON file obtained from your administrator.
    • Client ID: The client ID from the service account key JSON file obtained from your administrator.
    • Client Email: The client email from the service account key JSON file obtained from your administrator.
    • Private Key ID: The private key ID from the service account key JSON file obtained from your administrator.
    • Private Key: The private key from the service account key JSON file obtained from your administrator.

      The private key is required to be in between double quotations (e.g., "your_private_key").

  1. In the left panel, next to Datasets, click Create a new dataset .

  2. Find and select Google Cloud Storage Cloud Object connector from the data source list.

  3. Next to Data Sources, click New Data Source to add a new connection.

  4. Enter your connection credentials.

    • Connection Name: A name for the new connection
    • Billing Project: The project ID from the service account key JSON file obtained from your administrator
    • Authentication Method: The authentication method you want to use
    • Client ID: The client ID from the service account key JSON file obtained from your administrator
    • Client Secret The secret of the client from the service account key JSON file obtained from your administrator
    • Callback URL: The URL that is invoked after user login

Import Data

Once you've successfully created the connector, you can import data into MicroStrategy.

  1. Select the newly created connection.
  2. In the drop-down, select the bucket and browse the folders or files.
  3. Double-click files or drag them into the right pane.

    In the Preview pane, you can see the sample data and adjust the column type.

  4. Publish the cube to MicroStrategy with your selected data.

Limitations

Supported File Types

Only the following file types are supported:

  • .json
  • .parquet
  • .avro
  • .orc
  • .csv
  • Delta format

Select Folders

When selecting the entire folder, the folder must meet the following requirements:

  • All files under the folder need to have the same file types. A dialog will prompt you to choose the file type
  • All files share the same schema
  • If the folder has sub-folders, the sub-folders should be in a valid partitioned format. The following is an example of a valid folder structure

Spark Limitations

  • Only JSON files with each newline as a complete JSON can be read
  • Parquet files that contain special characters (, ; { } \ = " .) cannot be read
  • Parquet files with column data types as INT32(UINT_8)/(UNIT_16)/(UNIT_32)/(TIME_MILLIS) are not supported
  • Columns with the binary type cannot be published in cube
  • ORC files with field names prefixed with "_col" (e.g., _col0, _col1), where the file schema contains at least one nested structure, array, or map field, cannot be imported

Features

The following features are not supported:

  • Uploading MicroStrategy files connecting to Cloud Object Connector
  • Data wrangling in Data Import
  • Defining geography in Data Import
  • Advanced scheduling for Schedule Cube Publish
  • Group tables in Data Import