Skip to content
This repository was archived by the owner on Jun 14, 2024. It is now read-only.

Azure-Samples/microsoft-academic-graph-pyspark-samples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
page_type languages products description urlFragment
sample
python
azure
PySpark examples running on Azure Databricks to analyze sample Microsoft Academic Graph Data on Azure storage.
microsoft-academic-graph-pyspark-samples

Microsoft Academic Graph PySpark Samples

PySpark examples running on Azure Databricks to analyze sample Microsoft Academic Graph Data on Azure storage.

Prerequisites

Before running these examples, you need to complete the following setups:

Gather the information that you need

Before you begin, you should have these items of information:

✔️ The name of your Azure Storage (AS) account containing MAG dataset from Get Microsoft Academic Graph on Azure storage.

✔️ The access key of your Azure Storage (AS) account from Get Microsoft Academic Graph on Azure storage.

✔️ The name of the container in your Azure Storage (AS) account containing MAG dataset.

✔️ The name of the output container in your Azure Storage (AS) account.

Quickstart

  1. git clone https://github.com/Azure-Samples/microsoft-academic-graph-pyspark-samples.git

  2. Follow instructions in PySpark analytics samples for Microsoft Academic Graph to run PySpark scripts in this repository.

Resources

About

Sample PySpark code for interacting with the Microsoft Academic Graph

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Packages

No packages published

Contributors 6