- What is JSON?
- Converting JSON string to Python object
- Converting JSON file to Python object
- Converting Python object to JSON string
- Writing Python object to a JSON file
- Converting custom Python objects to JSON objects
- Creating Python class objects from JSON objects
JSON is a common standard used by websites and APIs and even natively supported by modern databases such as PostgreSQL. In this article, we’ll present a tutorial on how to handle JSON data with Python
For a detailed explanation, see our blog post.
JSON, or JavaScript Object Notation, is a format that uses text to store data objects:
{
   "name": "United States",
   "population": 331002651,
   "capital": "Washington D.C.",
   "languages": [
      "English",
      "Spanish"
   ]
}Let’s start with a simple example:
# JSON string
country = '{"name": "United States", "population": 331002651}'
print(type(country))The output of this snippet will confirm that this is indeed a string:
<class 'str'>We can call the json.loads() method and provide this string as a parameter.
import json
country = '{"name": "United States", "population": 331002651}'
country_dict = json.loads(country)
print(type(country))
print(type(country_dict))The output of this snippet will confirm that the JSON data, which was a string, is now a Python dictionary.
<class 'str'>
<class 'dict'>This dictionary can be accessed as usual:
print(country_dict['name'])
# OUTPUT:   United StatesIt is important to note here that the json.loads() method will not always return a dictionary. The data type that is returned will depend on the input string. For example, this JSON string will return a list, not a dictionary.
countries = '["United States", "Canada"]'
counties_list= json.loads(countries)
print(type(counties_list))
# OUTPUT:  <class 'list'>
Similarly, if the JSON string contains true, it will be converted to Python equivalent boolean value, which is True.
import json
 
bool_string = 'true'
bool_type = json.loads(bool_string)
print(bool_type)
# OUTPUT:  True
The following table shows JSON objects and the Python data types after conversion. For more details, see Python docs.
Save the following JSON data as a new file and name it united_states.json:
{
   "name": "United States",
   "population": 331002651,
   "capital": "Washington D.C.",
   "languages": [
      "English",
      "Spanish"
   ]
}Enter this Python script in a new file:
import json
with open('united_states.json') as f:
  data = json.load(f)
print(type(data))Running this Python file prints the following:
<class 'dict'>
The dictionary keys can be checked as follows:
print(data.keys())
# OUTPUT:  dict_keys(['name', 'population', 'capital', 'languages'])
Using this information, the value of name can be printed as follows:
data['name']
# OUTPUT:  United StatesSave this code in a new file as a Python script:
import json
languages = ["English","French"]
country = {
    "name": "Canada",
    "population": 37742154,
    "languages": languages,
    "president": None,
}
country_string = json.dumps(country)
print(country_string)When this file is run with Python, the following output is printed:
{"name": "Canada", "population": 37742154, "languages": ["English", "French"],
 "president": null}Lists can be converted to JSON as well. Here is the Python script and its output:
import json
languages = ["English", "French"]
languages_string = json.dumps(languages)
print(languages_string)
# OUTPUT:   ["English", "French"]It’s not just limited to a dictionary and a list. string, int, float, bool and even None value can be converted to JSON.
The method used to write a JSON file is dump():
import json
# Tuple is encoded to JSON array.
languages = ("English", "French")
# Dictionary is encoded to JSON object.
country = {
    "name": "Canada",
    "population": 37742154,
    "languages": languages,
    "president": None,
}
with open('countries_exported.json', 'w') as f:
    json.dump(country, f)To make it more readable, we can pass one more parameter to the dump() function as follows:
json.dump(country, f, indent=4)This time when you run the code, it will be nicely formatted with indentation of 4 spaces:
{
    "languages": [
        "English", 
        "French"
    ], 
    "president": null, 
    "name": "Canada", 
    "population": 37742154
}Save the following code as a Python script and run it:
import json
class Country:
    def __init__(self, name, population, languages):
        self.name = name    
        self.population = population
        self.languages = languages
    
canada = Country("Canada", 37742154, ["English", "French"])
print(json.dumps(canada))
# OUTPUT:   TypeError: Object of type Country is not JSON serializableTo convert the objects to JSON, we need to write a new class that extends JSONEncoder:
import json 
 
class CountryEncoder(json.JSONEncoder):
    def default(self, o): 
        if isinstance(o, Country):
           # JSON object would be a dictionary.
						return {
                "name" : o.name,
                "population": o.population,
                "languages": o.languages
            } 
        else:
            # Base class will raise the TypeError.
            return super().default(o)This class can now be supplied to the json.dump() as well as json.dumps() methods.
print(json.dumps(canada, cls=CountryEncoder))
# OUTPUT:  {“name": "Canada", "population": 37742154, "languages": ["English", "French"]}Using a custom encoder, we were able to write code like this:
# Create an object of class Country
canada = Country("Canada", 37742154, ["English", "French"])
# Use json.dump() to create a JSON file in writing mode
with open('canada.json','w') as f:
    json.dump(canada,f, cls=CountryEncoder)If we try to parse this JSON file using the json.load() method, we will get a dictionary:
with open('canada.json','r') as f:
    country_object = json.load(f)
# OUTPUT:  <type ‘dict'>To get an instance of the Country class instead of a dictionary, we need to create a custom decoder:
import json
 
class CountryDecoder(json.JSONDecoder):
    def __init__(self, object_hook=None, *args, **kwargs):
        super().__init__(object_hook=self.object_hook, *args, **kwargs)
    def object_hook(self, o):
        decoded_country =  Country(
            o.get('name'), 
            o.get('population'), 
            o.get('languages'),
        )
        return decoded_countryFinally, we can call the json.load() method and set the cls parameter to CountryDecoder class.
with open('canada.json','r') as f:
    country_object = json.load(f, cls=CountryDecoder)
print(type(country_object))
# OUTPUT:  <class ‘Country'>If you wish to find out more about Reading & Parsing JSON Data With Python, see our blog post.
