HATEOAS in the context of microservices

Published 2020-07-03

In the rise of RESTful, microservice-based architectures in our web applications, HATEOAS seems to have fallen out of favour. This is curious, given that the creator of REST specifically advocates for HATEOAS, going so far to say that REST isn’t truly REST without it. There have also been articles disparaching it as a practice, claiming that there isn’t a sensible alternative to the current practices.

So this article will re-evaluate HATEOAS’s usefulness, specifically in the context of modern microservices. I’ll quickly outline what HATEOAS is first, and then walk through a case study which will justify its usage.

What is HATEOAS

HATEOAS is a long, needlessly intimidating acronym for *Hypermedia as the Engine of Application State*.

Hypermedia is:

…an extension of the term hypertext, […] a nonlinear medium of information that includes graphics, audio, video, plain text and hyperlinks.

and hypertext is just text that you can view on your computer screen.

Bringing this all together, HATEOAS forms a principle that application state should be represented and – crucially – discovered by resources on the web. These resources are themselves hypermedia (e.g. web content such as JSON documents, located by URIs).

Those who interact with a RESTful API should be able to navigate through it like a text-based adventure:

$ a-text-adventure-game
  You are in an empty room with a door and an open window. What do you do?
    [1] Open door
    [2] Go to window
  > 2
  You are now at the window. What do you do?
    [1] Close window
    [2] Go back to centre of the room
  ...

Each “response” gives you the set of “valid” moves that can be made. You follow the moves and get another set of “valid” moves based on where you are currently at.

There are already clear cases where we use HATEOAS at least partially. When you want to present an image as part of your API, you give a URL to the source, rather than a binary string representation of the raw image itself. The same goes for other file types: PDFs, videos, music, etc… – hypermedia is the most versatile way we have of sharing (references to) content, so why not go further in using this?

The text-based adventure above in a RESTful API could look a bit like this:

// POST /
// Content-Type: application/vnd.atextadventuregame+json
{
    "session_id": "d62230ed-71f7-4fe2-8313-8b0f13e8c2a9"
    "message": "You are in an empty room with a door and an open window. What do you do?",
    "options": [
        {
            "id": "1",
            "label": "Close window",
            "_link": "/sessions/d62230ed-71f7-4fe2-8313-8b0f13e8c2a9/1"
        },
        {
            "id": "2",
            "label": "Go back to centre of the room",
            "_link": "/sessions/d62230ed-71f7-4fe2-8313-8b0f13e8c2a9/2"
        }
    ]
}


// POST /sessions/d62230ed-71f7-4fe2-8313-8b0f13e8c2a9/2
// Content-Type: application/vnd.atextadventuregame+json
{
    "session_id": "d62230ed-71f7-4fe2-8313-8b0f13e8c2a9"
    "message": "You are now at the window. What do you do?",
    "options": [
        {
            "id": "1",
            "label": "Open door",
            "_link": "/sessions/d62230ed-71f7-4fe2-8313-8b0f13e8c2a9/1"
        },
        {
            "id": "2",
            "label": "Go to window",
            "_link": "/sessions/d62230ed-71f7-4fe2-8313-8b0f13e8c2a9/2"
        }
    ]
}

Let’s look into a more realistic example.

Worked example: a retail product platform

Suppose that we have a tech company called Products-R-Us. It provides a platform for promoting retail products: people can create product catalogs to manage the products they sell, manage stock inventory and faciliate payments to sell their stock. We can easily imagine that this would have at least 3 different (micro)services built to handle each responsibility:

CatalogService - create and manage products and catalogs
InventoryService - manage stock inventory
PaymentService - manage payments

Products-R-Us exposes RESTful APIs to each of these services so that developers can build their own tools and UI around their platform, and they are fairly successful with this business strategy.

They soon hire data scentists to help expand their business using machine learning models. The goal is to build a product recommendations engine: clients can find out what products tend to be bought by people who bought product X. This results in a new microservice: RecommendationsService. This too will be exposed as a REST HTTP service.

So the question is this: how much data should we be exposing on our endpoints? Take GET /recommendations/products/:product_id. The endpoint will be used in a context where the client will present the recommended product to the customer. So on its face, we should be including things like:

product_name
thumbnail
price
merchant

But there are many use cases that would require us to fetch information from other microservices:

A client wanting to only recommend products that are in stock might ask to include in_stock as a property from the InventoryService.
A client wanting to show the available ways to pay for the product (e.g. debit card, Paypal, payment plan, etc…) may want payment_options as a property from the PaymentsService.
A client wanting to only recommend products that have good ratings may want to include average_rating and total_reviews as properties on each response object (with data from the CatalogService).
In the future we may have a DiscountsService which calculates applicable discounts to products offered on the platform. A client wanting to recommend products with said discounts may want a discounts property. Would we need to add this on later?

and so on, and so forth. If we implemented each of these features, not only would the microservice be bloated, but it would also have an incredibly brittle contract. The data it returns is largely dependent on downstream services which can vary for wildly very different reasons to this recommendations engine. A change in the way that stock is presented by the InventoryService would completely break the recommendations engine, when these two concepts are entirely unrelated from each other!

With HATEOAS, we can keep this coupling loose and let our response format be fairly lean. If we simply have references to the various resources, then we can let them vary independently, and allow clients to simply crawl the links for the data that they need.

// GET /recommendations/products/1
// Content-Type: application/vnd.productsrus+json
{
    "results": [
        {
            "product_id": "2",
            "relevance_score": 0.9,
            "_links": {
                "product": "/products/2",
                "reviews": "/reviews/products/2",
                "stock": "/inventory/products/2",
                "recommendations": "/recommendations/products/2"
            }
        },
        {
            "product_id": "70",
            "relevance_score": 0.4,
            "_links": {
                "product": "/products/70",
                "reviews": "/reviews/products/70",
                "stock": "/inventory/products/70",
                "recommendations": "/recommendations/products/70"
            }
        }
    ],
    "_links": {
        "products": "/products?id=2&id=70",
        "reviews": "/reviews/products?id=2&id=70",
        "stock": "/inventory/products?id=2&id=70"
    }
}

A client integrating this API would hit the endpoint to get the list of recommendations, and then follow the _links to build up any information that is necessary for enhancing their recommendation or presenting data in their view layer. If they want the full product information for the nth product returned, they visit data[n]._links.product. If they want all of the product information for all of the products returned, rather than hitting each individual product link, they can visit _links.products to get an appropriate collection of just the relevant products.

Notice the benefits that we get here:

Adding references to more resources is simply a case of adding more _links to the response. So if I want to add payment_options in the future, I just give a _link to that resource in the PaymentsService or CatalogService (whichever is most relevant).
There is a clear separation between data that is crucial to the service and metadata that is potentially useful (but very use-case specific).
Because the client just follows the links to get to the desired resources, you have more flexibility to evolve the links if you need to in the future. Note that it is okay to evolve the API, and this is a particularly forgiving way of doing so.
We’re giving the client the whole proverbial “fishing rod”: by hitting the relevant microservice for their desired information, they get everything they need when they need it, without us having to directly support it in this service. Crucially: if a downstream service has a new requirement resulting in adding a property to a resource, that change doesn’t ripple through to this service.
Also, a client that is primarily concerned with the recommendations API will not be affected by changes to the URL structure of the downstream services. This is why we don’t simply leave the product_id as a seed and tell the client to read the documentation to find other services - this abstracts yet another detail of the system from the client, making it easier to use. Other clients which do have a knowledge of said services won’t be so lucky though, but this still minimizes some of the risk involved in changing your links if you need to.

This is only possible however, if the underlying microservices provide a suitable level of querying for these desired resources. Given that we’re just talking about filtering by ID or multiple IDs though, this isn’t too bad – your clients will thank you for making your services easy to query!

Furthermore, if it is absolutely necessary that an endpoint return all of the data in one go, then a composite service can be built which aggregates the data together into a single endpoint (by following the links and combining the data together before returning the response). This effectively means building a facade to hide the complexity of the link crawling and composition; however, we gain the benefit of keeping our original service small, stable and flexible. I will close this article by addressing some dangling criticisms of HATEOAS.

HATEOAS is not about eliminating external documentation and embedding said documentation into the the responses served.

This would be like saying writing self-describing code over having extensive code comments has the end-goal of making documentation obselete. Even with self-describing class names, methods and comprehensive tests, you will still need external documentation to tell users how to use the system for different use cases. Self-describing code reveals intent, is easier to understand and change; self-describing API responses reduce the cognitive overhead of interpreting the data and reveal the relationship between different resources without directly coupling their data.

There are generally two different kinds of documentation:

Documentation which communicates to the developer(s) why certain design decisions were made, so that in the future it is easier to make changes to the codebase.
Documentation which is for the consumer of the API, which is not meant to explain the design decisions or reveal implementation details, but to make the API as easy as possible to use.

HATEOAS helps with (1), but has only partial effects on (2).

HATEOAS is not a silver bullet for versioning your API, nor is it intended to be.

So if you decide to add, rename or remove properties to/from your API responses, the consumers of your API will need to know whether or not the changes will break their current integrations. This is unavoidable. The changes within the API can be minimized by the fact that you’re referencing resources, so the changes will be on the resources themselves: but the changes ultimately need to be communicated to your consumers.

HATEOAS doesn’t require concensus over naming standards in order to be useful: simply a situation where resources are better served as external references instead of embedded content.

It doesn’t matter whether you decide to name your properties using _links or _link, or adopt HAL or the JSON API spec or anything else. These are conventions designed to align with how HTML hyperlinks are designed, but every REST API will require clients to understand particular aspects of the responses (e.g. knowing that they are hitting a product catalog service, and hence that they need to find a product_id in the response body). It is fine for you to use your own convention: so long as you keep it consistent within your own API, easy to understand and easy to consume.

HATEOAS in the context of microservices

What is HATEOAS

Worked example: a retail product platform

Helpful resources