Google DataSet Search: The New Dataset Indexer

A collection of data related to the UK.
Post Reply
bitheerani319
Posts: 854
Joined: Mon Dec 23, 2024 3:33 am

Google DataSet Search: The New Dataset Indexer

Post by bitheerani319 »

Google recently launched datasetsearch , a free tool for searching 25 million publicly available datasets.

The search tool includes filters to limit results based on their license (free or paid), format (csv, images, etc.), and update time.

The results also include descriptions of the dataset contents as well as author citations.

Google’s dataset aggregation methodology rcs data belarus from other dataset repositories, such as Amazon’s Open Data Registry. Unlike other repositories that organize and host their own datasets, Google does not curate or provide direct access to the 25 million datasets directly.

Instead, Google relies on dataset publishers to use open schema.org standards to describe the metadata of those datasets. Google then indexes and makes that metadata searchable across publishers.

Since publishers still need to host their own datasets, for-profit publishers that comply with schema.org standards will also have their datasets indexed by Google.

Currently, about half of the datasets in search results are from for-profit aggregators, with an even higher percentage when searching for market-related datasets.

Other popular dataset publishers on the platform include government agencies and research institutions. Google says that US government agencies alone have published more than 2 million datasets.

According to Google, most of the datasets are related to “geosciences, biology, and agriculture.”

To publish your own datasets, you can simply use the open standards from schema.org. The number of publicly available datasets is expected to continue to grow as more publishers comply with the standard.
Post Reply