MicroStrategy ONE

Import Data by Scraping a Wikipedia Page for Public Data

If you arrived here from Workstation, see the Workstation Document Authoring Help.

You can import data by extracting data from, or scraping, a Wikipedia page. The system imports the data as HTML tables. You can use web scraping to identify changes to pages.

Provide the URL for the Wikipedia page you want to import. If you have not identified a specific Wikipedia page, or want to research a topic, you can search Wikipedia for a topic, and from the results, choose the HTML tables in the pages for import.

  1. Create a blank dashboard or open an existing one.
  2. Choose Add Data > New Data to import data into a new dataset.

    or

    In the Datasets panel, click More next to the dataset name and choose Edit Dataset to add data to the dataset. The Preview Dialog opens. Click Add a new table.

    The Data Sources dialog opens.

  1. Click Public Data. The Public Data dialog opens.
  2. Enter your search text in Search for data to search for Wikipedia data.

    Enter states or weather forecast to search for a list of HTML tables that contain a list of states or weather forecast information.

    or

    Enter a URL in Search for data to import data from the corresponding Wikipedia page.

  3. Click Search. The HTML tables in the corresponding Wikipedia pages appear in a list.
  4. Hover over a link in the Source column to preview the corresponding table.
  5. Click a link in the Source column to view the corresponding table in your browser.
  6. Select the checkboxes that correspond with the tables you want to import.
  7. or

    Select the checkbox in the column header to select all tables for import.

  1. Click Prepare Data if you are adding a new dataset and want to preview, modify, and specify import options.
  2. or

    Click Add if you are editing an existing dataset.

  3. Click Finish if you are adding a new dataset.
  4. or

    Click Update Dataset if you are editing an existing dataset.

    The data imports into a new dataset or updates an existing dataset.

Related Topics

Import Data

Best Practices for Importing Data from a File

Create a Dashboard