Quickstart in Couchbase with Python and FastAPI

Learn to build a REST API in Python using FastAPI and Couchbase
See how you can fetch data from Couchbase using SQL++ queries
Explore CRUD operations in action with Couchbase

In this tutorial, you will learn how to connect to a Couchbase Capella cluster to create, read, update, and delete documents and how to write simple parametrized SQL++ queries.

Prerequisites

To run this prebuilt project, you will need:

Couchbase Capella cluster with travel-sample bucket loaded.
- To run this tutorial using a self managed Couchbase cluster, please refer to the appendix.
Python 3.9 or higher installed
- Ensure that the Python version is compatible with the Couchbase SDK.
Loading Travel Sample Bucket If travel-sample is not loaded in your Capella cluster, you can load it by following the instructions for your Capella Cluster:
- Load travel-sample bucket in Couchbase Capella

Note that this tutorial is designed to work with the latest Python SDK (4.x) for Couchbase. It will not work with the older Python SDK for Couchbase without adapting the code.

App Setup

Cloning Repo

git clone https://github.com/couchbase-examples/python-quickstart-fastapi.git

Install Dependencies

Any dependencies should be installed through pip, the default package manager for Python.

python -m pip install -r requirements.txt

Setup Database Configuration

To know more about connecting to your Capella cluster, please follow the instructions.

Specifically, you need to do the following:

Create the database credentials to access the travel-sample bucket (Read and Write) used in the application.
Allow access to the Cluster from the IP on which the application is running.

All configuration for communication with the database is fetched from the environment variables. We have provided a convenience feature in this quickstart to read the environment variables from a local file, .env in the source folder.

Create a copy of .env.example file & rename it to .env & add the values for the Couchbase cluster.

DB_CONN_STR=<connection_string>
DB_USERNAME=<user_with_read_write_permission_to_travel-sample_bucket>
DB_PASSWORD=<password_for_user>

Note: The connection string expects the couchbases:// or couchbase:// part.

Running the Application

Directly on Local Machine

At this point, we have installed the dependencies, loaded the travel-sample data and configured the application with the credentials. The application is now ready and you can run it.

The application will run on port 8080 of your local machine (http://localhost:8080). You will find the Swagger documentation of the API which you can use to try the API endpoints.

uvicorn app.main:app --reload

Using Docker

If you prefer to run this quick start using Docker, we have provided the Dockerfile which you can use to build the image and run the API as a container.

Build the Docker image

cd src
docker build -t couchbase-fastapi-quickstart .

Run the Docker image

docker run -it --env-file app/.env -p 8000:8000 couchbase-fastapi-quickstart

Note: The .env file has the connection information to connect to your Capella cluster. With the --env-file, docker will inject those environment variables to the container.

Verifying the Application

Once the application starts, you can see the details of the application on the logs.

The application will run on port 8000 of your local machine (http://localhost:8000). You will find the interactive Swagger documentation of the API if you go to the URL in your browser. Swagger documentation is used in this demo to showcase the different API end points and how they can be invoked. More details on the Swagger documentation can be found in the appendix.

Data Model

For this tutorial, we use three collections, airport, airline and route that contain sample airports, airlines and airline routes respectively. The route collection connects the airports and airlines as seen in the figure below. We use these connections in the quickstart to generate airports that are directly connected and airlines connecting to a destination airport. Note that these are just examples to highlight how you can use SQL++ queries to join the collections.

Let Us Review the Code

To begin this tutorial, clone the repo and open it up in the IDE of your choice. Now you can learn about how to create, read, update and delete documents in Couchbase Server.

Code Layout

Dockerfile
├── app
│   ├── config.py
│   ├── db.py
│   ├── main.py
│   ├── routers
│   │   ├── airline.py
│   │   ├── airport.py
│   │   └── route.py
│   └── tests
│       ├── conftest.py
│       ├── test_airline.py
│       ├── test_airport.py
│       └── test_route.py
└── requirements.txt

We have separated out the API code into separate files by the entity (collection) in the routers folder. The tests are similarly separated out by entity in the tests folder.

In main.py, we initialize the application including connecting to the database and add all the routes from individual API files.

We have the Couchbase SDK operations defined in the CouchbaseClient class inside the db.py file.

We recommend creating a single Couchbase connection when your application starts up, and sharing this instance throughout your application. If you know at startup time which buckets, scopes, and collections your application will use, we recommend obtaining them from the Cluster at startup time and sharing those instances throughout your application as well.

In this application, we have created the connection object in get_db method in db.py and we use this object in all of our APIs by caching the object. The object is initialized in main.py in the application lifecycle of FastAPI framework. We have also stored the reference to our bucket, travel-sample and the scope, inventory in the connection object. When the application is shutdown, we clean up the database connection by calling the close method which in turn calls the close defined in the Cluster object.

# main.py
from app.db import get_db
...

# Initialize couchbase connection
@asynccontextmanager
async def lifespan(app: FastAPI):
    """Method that gets called upon app initialization to initialize couchbase connection & close the connection on exit"""
    db = get_db()
    yield
    db.close()

# get_db method in db.py
@cache
def get_db()
    """Get Couchbase client"""
    load_dotenv()
    conn_str = os.getenv("DB_CONN_STR")
    username = os.getenv("DB_USERNAME")
    password = os.getenv("DB_PASSWORD")
    if conn_str is None:
        print("WARNING: DB_CONN_STR environment variable not set")
    if username is None:
        print("WARNING: DB_USERNAME environment variable not set")
    if password is None:
        print("WARNING: DB_PASSWORD environment variable not set")
    return CouchbaseClient(conn_str, username, password)

The Couchbase connection is established in the connect method defined in db.py. There, we call the Cluster method defined in the SDK to create the Database connection. If the connection is already established, we do not do anything. In our application, we have the same bucket and scope that is used by all the APIs. The collection will change depending on the API route.

# connect method in CouchbaseClient class in db.py
# authentication for Couchbase cluster
auth = PasswordAuthenticator(self.username, self.password)

cluster_opts = ClusterOptions(auth)
# wan_development is used to avoid latency issues while connecting to Couchbase over the internet
cluster_opts.apply_profile("wan_development")

# connect to the cluster
self.cluster = Cluster(self.conn_str, cluster_opts)

# wait until the cluster is ready for use
self.cluster.wait_until_ready(timedelta(seconds=5))

# get a reference to our bucket
self.bucket = self.cluster.bucket(self.bucket_name)

# get a reference to our scope
self.scope = self.bucket.scope(self.scope_name)

# close method in CouchbaseClient class in db.py
self.cluster.close()

Airport Entity

For this tutorial, we will focus on the airline entity. The other entities are similar.

We will be setting up a REST API to manage airport documents.

POST Airport – Create a new airport
GET Airport – Read specified airport
PUT Airport – Update specified airport
DELETE Airport – Delete airport
Airport List – Get all airports. Optionally filter the list by country
Direct Connections - Get a list of airports directly connected to the specified airport

For CRUD operations, we will use the Key-Value operations that are built into the Couchbase SDK to create, read, update, and delete a document. Every document will need an ID (similar to a primary key in other databases) to save it to the database. This ID is passed in the URL. For other end points, we will use SQL++ to query for documents.

Airport Document Structure

Our profile document will have an airportname, city, country, faa code, icao code, timezone info and the geographic coordinates. For this demo, we will store all airport information in one document in the airport collection in the travel-sample bucket.

{
  "airportname": "Sample Airport",
  "city": "Sample City",
  "country": "United Kingdom",
  "faa": "SAA",
  "icao": "SAAA",
  "tz": "Europe/Paris",
  "geo": {
    "lat": 48.864716,
    "lon": 2.349014,
    "alt": 92
  }
}

POST Airport

Open the airport.py file and navigate to the create_airport method. The json data in the body is validated using the models defined by Pydantic by FastAPI. We create a dictionary using the Pydantic model's model_dump method that we insert into the database using the datbase client, db.

# create_airport method in airport.py
db.insert_document(AIRPORT_COLLECTION, id, airport.model_dump())
return airport

We call the insert_document method in CouchbaseClient class, which calls the insert method for the collection defined in the Couchbase SDK. The insert method takes the key (ID) by which the document is referenced and the content to be inserted into the collection.

# CouchbaseClient class in db.py
def insert_document(self, collection_name: str, key: str, doc: dict):
  """Insert document using KV operation"""
  return self.scope.collection(collection_name).insert(key, doc)

GET Airport

Navigate to the read_airport method in the airport.py file. We only need the airport document ID or our key from the user to retrieve a particular airport document using a key-value operation which is passed to the get_document method. The result is converted into a python dictionary using the .content_as[dict] operation defined for the result returned by the SDK.

# read_airport method in airport.py
db.get_document(AIRPORT_COLLECTION, id).content_as[dict]

The CouchbaseClient client get_document method calls the get method defined for collections in the Couchbase SDK. We fetch the documents based on the key by which it is stored.

# CouchbaseClient class in db.py
def get_document(self, collection_name: str, key: str):
  """Get document by key using KV operation"""
  return self.scope.collection(collection_name).get(key)

If the document is not found in the database, we get an exception, DocumentNotFoundException from the SDK and return the status as 404.

PUT Airport

Update an Airport by Document ID

Navigate to update_airport method in airport.py. We use the ID value passed in via the URL to call the upsert_document method in CouchbaseClient passing it the key and the validated data provided in the request body.

# update_airport method in airport.py
db.upsert_document(AIRPORT_COLLECTION, id, airport.model_dump())
return airport

The CouchbaseClient class upsert_document method calls the upsert method defined for collection in the Couchbase SDK with the key and json data to update the document in the database.

# CouchbaseClient class in db.py
def upsert_document(self, collection_name: str, key: str, doc: dict):
  """Upsert document using KV operation"""
  return self.scope.collection(collection_name).upsert(key, doc)

DELETE Airport

Navigate to the delete_airport method in airport.py. We only need the key or document ID from the user to delete a document using the key-value operation.

# delete_airport method in airport.py
db.delete_document(AIRPORT_COLLECTION, id)

The delete_document method in CouchbaseClient class calls the remove method defined for collection in the Couchbase SDK sending the key of the document to remove from the database.

# CouchbaseClient class in db.py
def delete_document(self, collection_name: str, key: str):
    """Delete document using KV operation"""
    return self.scope.collection(collection_name).remove(key)

List Airport

This endpoint retrieves the list of airports in the database. The API has options to specify the page size for the results and country from which to fetch the airport documents.

SQL++ is a powerful query language based on SQL, but designed for structured and flexible JSON documents. We will use a SQL++ query to search for airports with Limit, Offset, and Country option.

Navigate to the get_airports_list method in the airport.py file. This endpoint is different from the others we have seen before because it makes the SQL++ query rather than a key-value operation. This usually means more overhead because the query engine is involved. For this query, we are using the predefined indices in the travel-sample bucket. We can create an additional index specific for this query to make it perform better.

First, we need to get the values from the query string for country, limit, and Offset that we will use in our query. These are pulled in by the annotations on the method in FastAPI.

This end point has two queries depending on the value for the country parameter. If a country name is specified, we retrieve the airport documents for that specific country. If it is not specified, we retrieve the list of airports across all countries. The queries are slightly different for these two scenarios.

We build our SQL++ query using the parameters specified by $ symbol for both these scenarios. The difference between the two queries is the presence of the country parameter in the query. Normally for the queries with pagination, it is advised to order the results to maintain the order of results across multiple queries.

Next, we pass that query to the CouchbaseClient class query method. We save the results in a list, airports. By default, the Python SDK will stream result set from the server. To gather all the results, we need to iterate over the results.

# get_airports_list method in airport.py

# create query
if country:
    query = """
        SELECT airport.airportname,
            airport.city,
            airport.country,
            airport.faa,
            airport.geo,
            airport.icao,
            airport.tz
        FROM airport AS airport
        WHERE airport.country = $country
        ORDER BY airport.airportname
        LIMIT $limit
        OFFSET $offset;
    """
else:
    query = """
        SELECT airport.airportname,
            airport.city,
            airport.country,
            airport.faa,
            airport.geo,
            airport.icao,
            airport.tz
        FROM airport AS airport
        ORDER BY airport.airportname
        LIMIT $limit
        OFFSET $offset;
    """

# run query
result = db.query(query, country=country, limit=limit, offset=offset)
# gather all documents
airports = [r for r in results]
return airports

The query method in the CouchbaseClient class executes the SQL++ query using the query method defined in the Scope by the Couchbase SDK.

# CouchbaseClient class in db.py
def query(self, sql_query, *options, **kwargs):
  """Query Couchbase using SQL++"""
  # options are used for positional parameters
  # kwargs are used for named parameters
  return self.scope.query(sql_query, *options, **kwargs)

Direct Connections

This endpoint fetches the airports that can be reached directly from the specified source airport code. This also uses a SQL++ query to fetch the results simlar to the List Airport endpoint.

Let us look at the query used here:

SELECT distinct (route.destinationairport)
FROM airport as airport
JOIN route as route on route.sourceairport = airport.faa
WHERE airport.faa = $airport and route.stops = 0
ORDER BY route.destinationairport
LIMIT $limit
OFFSET $offset

We are fetching the direct connections by joining the airport collection with the route collection and filtering based on the source airport specified by the user and by routes with no stops.

Running Tests

We have defined integration tests using pytest for all the API end points. The integration tests use the same database configuration as the application. For the tests, we perform the operation using the API and confirm the results by checking the documents in the database. For example, to check the creation of the document by the API, we would call the API to create the document and then read the same document directly from the database using the CouchbaseClient and compare them. After the tests, the documents are cleaned up.

The tests including the fixtures and helpers for the tests are configured in the conftest.py file in the tests folder.

To run the tests, use the following command:

python -m pytest

Appendix

Extending API by Adding New Entity

If you would like to add another entity to the APIs, these are the steps to follow:

Create the new entity (collection) in the Couchbase bucket. You can create the collection using the SDK or via the Couchbase Server interface.
Define the routes in a new file in the routers folder similar to the existing routes like airport.py.
Add the new routes to the application in app.py.
Add the tests for the new routes in a new file in the tests folder similar to test_airport.py.

Running Self Managed Couchbase Cluster

If you are running this quickstart with a self managed Couchbase cluster, you need to load the travel-sample data bucket in your cluster and generate the credentials for the bucket.

Follow Couchbase Installation Options for installing the latest Couchbase Database Server Instance.

You need to update the connection string and the credentials in the .env file in the source folder.

Note: Couchbase Server must be installed and running prior to running the FastAPI Python app.

Swagger Documentation

Swagger documentation provides a clear view of the API including endpoints, HTTP methods, request parameters, and response objects.

Click on an individual endpoint to expand it and see detailed information. This includes the endpoint's description, possible response status codes, and the request parameters it accepts.

Trying Out the API

You can try out an API by clicking on the "Try it out" button next to the endpoints.

Parameters: If an endpoint requires parameters, Swagger UI provides input boxes for you to fill in. This could include path parameters, query strings, headers, or the body of a POST/PUT request.
Execution: Once you've inputted all the necessary parameters, you can click the "Execute" button to make a live API call. Swagger UI will send the request to the API and display the response directly in the documentation. This includes the response code, response headers, and response body.

Models

Swagger documents the structure of request and response bodies using models. These models define the expected data structure using JSON schema and are extremely helpful in understanding what data to send and expect.

Contents