DEV Community

Cover image for 13 - Gin Handler Timeout Middleware
Jacob Goodwin
Jacob Goodwin

Posted on

13 - Gin Handler Timeout Middleware

We just added the ability to store refresh tokens in Redis. Today, let's take a break from the main application features to learn how to create our first Gin middleware!

If at any point you are confused about file structure or code, go to Github repository and check out the branch for the previous lesson to be in sync with me!

I also wanted to mentioned that I made two updates in the main branch of the repository for more easily getting the application running for newcomers to this project. The steps to run the project have been updated in the README, as well.

First, I decided to add the .env.dev file to the repository. For development, I don't plan to include any API secrets directly in this file. Later on, however, we'll reference a Google Cloud Configuration file in .env.dev, so you'll need to make sure to keep that configuration file secret as always.

Second, I've added a make rule, make init, which will start up our Postgres containers and run database migrations so that you'll be ready to start creating data (users for now) with the application!

If you prefer video, check out the video version below!

Handler Timeout Middleware

Here's the ol' diagram of what we'll be working on today.

Handler Timeout Middleware Diagram

If we look at the right-hand side of our middleware, we see our application layers. The handler layer methods all receive a *gin.Context. This context is provided to us by Gin and gives us access to numerous utility methods that make working with HTTP requests and responses easier.

The problem we're trying to solve is if a call to these handler methods (including all of the downstream calls to the service/repository layers and data sources) takes a long time, we want to be able to terminate the handler and send an error to the user.

We can do this with built-in functionality from the context package of Go, by wrapping the request context using the context.WithTimeout method. When we wrap our context with this method, we get access to the context.Done() method which "returns a channel that's closed when work done on behalf of this context should be canceled" (source). After using WithTimeout, the context will be cancelled for us automatically after the duration of time we pass to the method.

We'll save discussing the timeoutWriter for when we write the middleware code.

What About http.Server Timeout Fields?

If we look in ~/main.go, we initialize our server with Go's http.Server.



  srv := &http.Server{
        Addr:    ":8080",
        Handler: router,
  }


Enter fullscreen mode Exit fullscreen mode

You might be asking why we don't just make use of the net/http package's Server fields like ReadTimeout or WriteTimeout. These timeouts are concerned with the time to read and write the body to and from the client (i.e. networking I/O) and don't deal with the case of long running-handlers. There's a fine answer on StackOverflow you might want to read.

Middleware In Gin

Custom Middleware Function

Looking back at the diagram, remember that our goal is to manipulate the request context, which we extract from the gin.Context using the Request() method. I'll reiterate that the Gin and request context are two different contexts, which can be confusing.

To create a custom middleware in Gin, we create a function that returns a gin.HandlerFunc, which is defined as:



type HandlerFunc func(*Context)


Enter fullscreen mode Exit fullscreen mode

In middleware HandlerFunc, we can modify this gin.Context according to our needs. In our handler timeout middleware, we will wrap our the request context with a timeout and handle concurrency issues. Another simple example is a logger, as shown in the Gin docs. Yet another case you might see is extracting a user from a session or token, and setting this user as a key on gin.Context. In fact, we'll do just that in a handful of tutorials!

Usage in Gin

I recommend checking out the Using Middleware section of the Gin docs to see how you can call a middleware function for routes, groups, and individual routes! We'll end up applying middleware to our existing route group in ~/handler/handler.go with the Use method.



  // Create a group, or base url for all routes
    g := c.R.Group(c.BaseURL)

    if gin.Mode() != gin.TestMode {
        g.Use() // middleware will be called in Use()
    }


Enter fullscreen mode Exit fullscreen mode

Why Make Our Own Middleware?

I found the following open source middlewares for setting a handler timeout.

The first is the Go http.TimeoutHandler. The downside to this handler is that it's not configured to directly work with Gin routes and groups. Another thing I didn't like about the handler is that it is configured to send an HTML response in the case of a timeout, and doesn't appear to have a way to send a different Content-Type for a response.

The second middleware I found was gin-contrib/timeout. Note that this repository of middlewares is not the official one by gin-gonic, gin-gonic/contrib, even though it has a similar name. This middleware had issues trying to rewrite headers to a response writer, which we'll discuss when we write our middleware.

The third middleware I found was vearne/gin-timeout. This middleware actually addressed the above issues, and so I have no problem recommending it. However, it did not provide a way to modify the error response sent to the user.

Since getting a timeout handler middleware to work as I would like would require modifying any of the available handlers, I decided it would just be easier to create our own, even if I'm borrowing heavily from http.TimeoutHandler and vearne/gin-timeout.

Middleware Prep

Add A Pretend Delay to Signin Handler

In ~/handler/handler.go, let's fake a handler that takes a long time to complete. We'll use the Signin handler method as it currently doesn't yet have a "real" implementation. We'll set the Sleep duration to 6 seconds, as we'll end up setting our timeout limit to 5 seconds.



// Signin handler
func (h *Handler) Signin(c *gin.Context) {
    time.Sleep(6 * time.Second) // to demonstrate a timeout
    c.JSON(http.StatusOK, gin.H{
        "hello": "it's signin",
    })
}


Enter fullscreen mode Exit fullscreen mode

Update App Errors

I want to add a ServiceUnavailable (HTTP 503) to our ~/model/apperrors/apperrors.go to send when our handlers run longer than 5 seconds.



// update const
const (
    // ... other types excluded for brevity
    ServiceUnavailable   Type = "SERVICE_UNAVAILABLE"    // For long running handlers
)

// update status method switch case to handle ServiceUnavailable

func (e *Error) Status() int {
    switch e.Type {
    // other cases and default excluded for brevity
    case ServiceUnavailable:
        return http.StatusServiceUnavailable
    // cases omitted
    }
}

// add an error factory
// NewServiceUnavailable to create an error for 503
func NewServiceUnavailable() *Error {
    return &Error{
        Type:    ServiceUnavailable,
        Message: fmt.Sprintf("Service unavailable or timed out"),
    }
}


Enter fullscreen mode Exit fullscreen mode

Timeout Duration Environment Variable

Let's make timeout duration configurable. We'll set a handler timeout of 5 seconds for now. In reality, we could set different timeout lengths for different handlers or groups of handlers. As an example, you would probably expect the time to upload a file to a cloud service to take longer than a handler that merely updates text fields in the database. I encourage you to do this in your applications.

In .env.go add the following.



HANDLER_TIMEOUT=5


Enter fullscreen mode Exit fullscreen mode

We now need to update the instantiation of our handler via handler.Config to accept this field as a duration. In ~/handler/handler.go.



// Config will hold services that will eventually be injected into this
// handler layer on handler initialization
type Config struct {
    R               *gin.Engine
    UserService     model.UserService
    TokenService    model.TokenService
    BaseURL         string
    TimeoutDuration time.Duration
}


Enter fullscreen mode Exit fullscreen mode

Let's also load the environment variable and pass it to our handler config in ~/injection.go of package main.



  // read in HANDLER_TIMEOUT
    handlerTimeout := os.Getenv("HANDLER_TIMEOUT")
    ht, err := strconv.ParseInt(handlerTimeout, 0, 64)
    if err != nil {
        return nil, fmt.Errorf("could not parse HANDLER_TIMEOUT as int: %w", err)
    }

    handler.NewHandler(&handler.Config{
        R:               router,
        UserService:     userService,
        TokenService:    tokenService,
        BaseURL:         baseURL,
        TimeoutDuration: time.Duration(time.Duration(ht) * time.Second),
    })


Enter fullscreen mode Exit fullscreen mode

We now have a configurable handler timeout duration! Woot woot!

Timeout Middleware Code

After quite a preface, let's add the middleware code. Our middleware will be added to a sub-package of handler. Let's add a folder and file at the path ~/handler/middleware/timeout.go. We'll go step-by-step over the code we add to this middleware function.

Middleware Function Definition

Our middleware will accept a timeout duration, which will be received from handler.Config, and an error that we'll send to the user in case of a timeout. We'll end up passing the custom ServiceUnavailable error we recently created.



package middleware

// Imports omitted

func Timeout(timeout time.Duration, errTimeout *apperrors.Error) gin.HandlerFunc {
    return func(c *gin.Context) {
    // Code will go here
  }
}


Enter fullscreen mode Exit fullscreen mode

Set Gin Writer to Custom Timeout Writer

We first set the writer on Gin's context, c, to a custom writer, tw. You'll also observe that this writer is passed as a field to our custom timeoutWriter. This is a little confusing, but will hopefully become clearer after we go over the timeoutWriter definition.



    // set Gin's writer as our custom writer
        tw := &timeoutWriter{ResponseWriter: c.Writer, h: make(http.Header)}
        c.Writer = tw


Enter fullscreen mode Exit fullscreen mode

Wrap Context in Timeout

In order to apply a timeout duration, we can use context.WithTimeout as follows. We defer the cancel function's execution until the end of the function. When cancel is called, all resources associated with it context, ctx, will be freed up.



    // wrap the request context with a timeout
        ctx, cancel := context.WithTimeout(c.Request.Context(), timeout)
        defer cancel()

        // update gin request context
        c.Request = c.Request.WithContext(ctx)


Enter fullscreen mode Exit fullscreen mode

Channels For Finished of Panic "Signals"

We create channels that we'll send values into if our handler successfully completes or if the application panics.



    finished := make(chan struct{})        // to indicate handler finished
        panicChan := make(chan interface{}, 1) // used to handle panics if we can't recover


Enter fullscreen mode Exit fullscreen mode

Call Handler Inside of a GoRoutine

We then create and invoke a goroutine which calls c.Next(). Next is a helper method which calls the next middleware or handler function (you can "cascade" handler functions and middlewares). As of yet, we only have a single middleware, so c.Next() will call a handler method. If the method completes, we pass an empty struct into the finished channel.

We also defer a call to recover from a panic and send the captured value from the panic and send it to the panicChan (it turns out we don't do anything with this value, so send and empty struct, if you prefer).



    go func() {
            defer func() {
                if p := recover(); p != nil {
                    panicChan <- p
                }
            }()

            c.Next() // calls subsequent middleware(s) and handler
            finished <- struct{}{}
        }()


Enter fullscreen mode Exit fullscreen mode

Handling Either Finished, Panic, or Timeout/context.Done()

After invoking the goroutine, we then create a select statement that listens for incoming values on one of three channels. In all of these cases, we then write a header and body to a response writer embedded inside of our custom timeoutWriter in variable tw.

This may be a little confusing, writing to a Writer inside of a Writer, but this is how we prevent overwriting response bodies and headers. We basically use the timeoutWriter as a temporary/proxy writer that also stores the state of whether or not we have either timed out or already sent the response from a handler.



    select {
        case <-panicChan:
            // if we cannot recover from panic,
            // send internal server error
            e := apperrors.NewInternal()
            tw.ResponseWriter.WriteHeader(e.Status())
            eResp, _ := json.Marshal(gin.H{
                "error": e,
            })
            tw.ResponseWriter.Write(eResp)
        case <-finished:
            // if finished, set headers and write resp
            tw.mu.Lock()
            defer tw.mu.Unlock()
            // map Headers from tw.Header() (written to by gin)
            // to tw.ResponseWriter for response
            dst := tw.ResponseWriter.Header()
            for k, vv := range tw.Header() {
                dst[k] = vv
            }
            tw.ResponseWriter.WriteHeader(tw.code)
            // tw.wbuf will have been written to already when gin writes to tw.Write()
            tw.ResponseWriter.Write(tw.wbuf.Bytes())
        case <-ctx.Done():
            // timeout has occurred, send errTimeout and write headers
            tw.mu.Lock()
            defer tw.mu.Unlock()
            // ResponseWriter from gin
            tw.ResponseWriter.Header().Set("Content-Type", "application/json")
            tw.ResponseWriter.WriteHeader(errTimeout.Status())
            eResp, _ := json.Marshal(gin.H{
                "error": errTimeout,
            })
            tw.ResponseWriter.Write(eResp)
            c.Abort()
            tw.SetTimedOut()
        }


Enter fullscreen mode Exit fullscreen mode

In the case of a panic, we simply write an internal server error with the appropriate status code.

If we receive on the finished channel, this means that our custom timeoutWriter will have the headers written by Gin to its h field, which will store headers. We then copy these headers to the inner, gin.ResponseWriter, referenced by dst. We then send the status code from tw, and the actual response body from the handler which is stored in tw.wbuf. We make sure to lock tw using it's mutex field to prevent writes in the case we "simultaneously" receive a on another channel.

In the case when the context times out, we receive on the channel returned by ctx.Done(). We also create a response with the *apperrors.Error passed to the middleware as a parameter. We then call c.Abort() which prevents any subsequent handlers or middlewares from being called, and set tw's timedOut field to true so that if the handler response eventually comes through, it will not be written to the client.

Let's now discuss the timeoutWriter in detail.

Custom Timeout Writer Explanation

The code for timeoutWriter is included below.



// implements http.Writer, but tracks if Writer has timed out
// or has already written its header to prevent
// header and body overwrites
// also locks access to this writer to prevent race conditions
// holds the gin.ResponseWriter which we'll manually call Write()
// on in the middleware function to send response
type timeoutWriter struct {
    gin.ResponseWriter
    h    http.Header
    wbuf bytes.Buffer // The zero value for Buffer is an empty buffer ready to use.

    mu          sync.Mutex
    timedOut    bool
    wroteHeader bool
    code        int
}

// Writes the response, but first makes sure there
// hasn't already been a timeout
// In http.ResponseWriter interface
func (tw *timeoutWriter) Write(b []byte) (int, error) {
    tw.mu.Lock()
    defer tw.mu.Unlock()
    if tw.timedOut {
        return 0, nil
    }

    return tw.wbuf.Write(b)
}

// In http.ResponseWriter interface
func (tw *timeoutWriter) WriteHeader(code int) {
    checkWriteHeaderCode(code)
    tw.mu.Lock()
    defer tw.mu.Unlock()
    // We do not write the header if we've timed out or written the header
    if tw.timedOut || tw.wroteHeader {
        return
    }
    tw.writeHeader(code)
}

// set that the header has been written
func (tw *timeoutWriter) writeHeader(code int) {
    tw.wroteHeader = true
    tw.code = code
}

// Header "relays" the header, h, set in struct
// In http.ResponseWriter interface
func (tw *timeoutWriter) Header() http.Header {
    return tw.h
}

// SetTimeOut sets timedOut field to true
func (tw *timeoutWriter) SetTimedOut() {
    tw.timedOut = true
}

func checkWriteHeaderCode(code int) {
    if code < 100 || code > 999 {
        panic(fmt.Sprintf("invalid WriteHeader code %v", code))
    }
}


Enter fullscreen mode Exit fullscreen mode

This writer implements an http.ResponseWriter with the required Write, WriteHeader, and Header methods. In the middleware, we replaced the writer on *gin.Context with the custom timeoutWriter. Therefore, when the Gin handlers write a response, they will end up calling timeoutWriter.Write, timeoutWriter.WriterHeader and timeoutWriter.Header.

Let's take a look at what would happen if we use this middleware and send a response from the Signin handler.



  c.JSON(http.StatusOK, gin.H{
        "hello": "it's signin",
  })


Enter fullscreen mode Exit fullscreen mode

When Gin calls the JSON method it will set the response body by calling timeoutWriter.Write with the JSON body inside of the gin.H map. If we look at timeoutWriter.Write, you can see that it checks that we haven't already set the timedOut field. This is how we prevent overwriting our response body! If we haven't timed out, we write the body to our custom field, wbuf. This wbuf temporarily stores the valid response which gets "relayed" to the client in the finished channel branch of the switch case.

The c.JSON method will also call timeoutWriter.Header() to get access to our writer's headers, timeoutWriter.h. Gin then sets headers such as Content-Type and Content-Length.

Finally, c.JSON will also set the HTTP status via the timeoutWriter.WriteHeader() method. Our custom WriteHeader() method will check to see if the header has already been written or if we've already timed out via the fields timedOut and wroteHeader. If we've already timed out or written a header, we do nothing. Otherwise, we set wroteHeader and the HTTP status code to our struct.

We also have a mu sync.Mutex in our custom writer. You can see that anytime we write to our timeoutWriter, we call tw.mu.Lock(). When we add a sync.Mutex to a struct, this allows us to lock struct to updates by concurrent routines which may want to access our timeoutWriter data (sorry if my terminology is off). Once all of the updates to our timeoutWriter data have been made, we Unlock access to the fields/data.

Apply Middleware

In ~/handler/handler.go we'll use our middleware immediately after creating our route group.



// Create a group, or base url for all routes
    g := c.R.Group(c.BaseURL)

    if gin.Mode() != gin.TestMode {
        g.Use(middleware.Timeout(c.TimeoutDuration, apperrors.NewServiceUnavailable()))
    }


Enter fullscreen mode Exit fullscreen mode

You should now be able to run your app with docker-compose up and send a request to the "signin" endpoint as follows (with Curl or other client):



➜ curl --location --request POST 'http://malcorp.test/api/account/signin' \
--header 'Authorization: Bearer {{idToken}}' \
--header 'Content-Type: application/json' \
--data-raw '{}'
{"error":{"type":"SERVICE_UNAVAILABLE","message":"Service unavailable or timed out process"}}% 


Enter fullscreen mode Exit fullscreen mode

If you then set the timeout in Signin to something less than 5 seconds (our application's handler timeout duration), you should receive a status 200 response with JSON.



// Signin handler
func (h *Handler) Signin(c *gin.Context) {
    time.Sleep(1 * time.Second) // to demonstrate normal use
    c.JSON(http.StatusOK, gin.H{
        "hello": "it's signin",
    })
}


Enter fullscreen mode Exit fullscreen mode

Rerunning the same curl command we receive:



{"hello":"it's signin"}%

Enter fullscreen mode Exit fullscreen mode




Conclusion

That was actually a relatively complex middleware. If you're like me when I was learning from other's work, you may have to stare at it for a while until it makes sense. Yet I hope I've laid it out in a manner that saves you from a few minutes of confusion! Heck, I'm not sure I did everything right, so let me know if you think there are any blaring mistakes!

Next time, we'll write and test the Signin handler. Following that, we'll add service and repository method implementations to complete the full signin functionality.

¡Hasta pronto!

Top comments (4)

Collapse
 
terashin777 profile image
terashin777 • Edited

Thank you for your great article.

I have a question.
Why you lock tw using it's mutex field?

I think select is evaluated once.
For example, if finish and ctx.done are received simultaneously, one of which is processed.
And channels except finished don't read tw.wbuf.
So I think even if main thread write something to tw.wbuf, panicChan and ctx.Done() select process are not affected.

Collapse
 
ivanovaleksey profile image
Aleksey Ivanov

I don't get it either

Collapse
 
dimgsg9 profile image
Dima G

Sorry for a late reply :)

I've tested the response time with and without "timeout" middleware and didn't notice any additional latency introduced by the middleware.
95th percentile for /account/signup with & without the middleware was ~97ms

Also tested with and without traefik:
95th percentile for /account/signup with & without the traefik was ~97ms

No concurrent requests tested, just 20 sequential HTTP requests to /signup endpoint

Environment: docker inside Vagrant VM, client: Postman on host machine

Would be interesting to see the timeout middleware impact with a proper load test...

Collapse
 
whuangz profile image
William

is it only me that think by implementing timeout middleware, make the response slower than usual?

I tried to implement your timeout code in my project
usually when I fetch 100 data from my repo to JSON it' around 200 - 500 ms

but by implementing the timeout it could reach 4-5 s