2017-11-20 16:21:40 +00:00
# Golang Kinesis Consumer
2014-07-25 06:03:41 +00:00
2017-11-20 19:14:39 +00:00
Kinesis consumer applications written in Go. This library is intended to be a lightweight wrapper around the Kinesis API to read records, save checkpoints (with swappable backends), and gracefully recover from network errors.
2017-11-20 17:37:30 +00:00
2017-11-20 17:45:57 +00:00
_NOTE: With the release of [Kinesis to Firehose ](http://docs.aws.amazon.com/firehose/latest/dev/writing-with-kinesis-streams.html ) it's possible to archive data directly to S3, Redshift, or Elasticsearch without running a consumer application._
2016-02-03 05:04:22 +00:00
2017-11-20 18:29:30 +00:00
_UPDATE: To avoid managing checkpoints and running any infrastructure it's also possible to [Process Kinensis Streams with Golang and Lambda ](https://medium.com/@harlow/processing-kinesis-streams-w-aws-lambda-and-golang-264efc8f979a )._
2017-11-20 16:21:40 +00:00
## Installation
2015-05-23 15:52:08 +00:00
2017-11-20 16:21:40 +00:00
Get the package source:
$ go get github.com/harlow/kinesis-consumer
2015-05-23 23:18:10 +00:00
2014-07-25 06:03:41 +00:00
## Overview
2014-07-25 06:03:41 +00:00
2017-11-20 16:21:40 +00:00
The consumer leverages a handler func that accepts a Kinesis record. The `Scan` method will consume all shards concurrently and call the callback func as it receives records from the stream.
2016-02-03 05:04:22 +00:00
2016-02-09 03:42:26 +00:00
```go
2017-11-20 16:21:40 +00:00
import consumer "github.com/harlow/kinesis-consumer"
2016-02-03 05:04:22 +00:00
func main() {
2017-11-20 16:21:40 +00:00
log.SetHandler(text.New(os.Stderr))
log.SetLevel(log.DebugLevel)
var (
app = flag.String("app", "", "App name") // name of consumer group
stream = flag.String("stream", "", "Stream name")
)
flag.Parse()
c, err := consumer.New(*app, *stream)
if err != nil {
log.Fatalf("new consumer error: %v", err)
}
c.Scan(context.TODO(), func(r *kinesis.Record) bool {
fmt.Println(string(r.Data))
return true // continue scanning
})
2016-02-03 05:04:22 +00:00
}
```
2014-07-25 06:03:41 +00:00
2017-11-20 16:21:40 +00:00
Note: If you need to aggregate based on a specific shard the `ScanShard` method should be leverged instead.
### Configuration
2016-12-04 08:08:06 +00:00
2017-11-20 16:21:40 +00:00
The consumer requires the following config:
2016-12-04 08:08:06 +00:00
2017-11-20 16:21:40 +00:00
* App Name (used for checkpoints)
* Stream Name (kinesis stream name)
It also accepts the following optional overrides:
* Kinesis Client
2017-11-20 19:14:39 +00:00
* Checkpoint Storage
2017-11-20 16:21:40 +00:00
* Logger
```go
2017-11-20 17:37:30 +00:00
// new kinesis client
2017-11-20 16:21:40 +00:00
svc := kinesis.New(session.New(aws.NewConfig()))
2017-11-20 17:37:30 +00:00
// new consumer with custom client
2017-11-20 16:21:40 +00:00
c, err := consumer.New(
appName,
streamName,
consumer.WithClient(svc),
)
2016-12-04 08:08:06 +00:00
```
2017-11-20 16:21:40 +00:00
2017-11-20 19:14:39 +00:00
### Checkpoint Storage
2017-11-20 16:21:40 +00:00
2017-11-20 19:06:46 +00:00
To record the progress of the consumer in the stream we store the last sequence number the consumer has read from a particular shard. This will allow consumers to re-launch and pick up at the position in the stream where they left off.
2017-11-20 19:14:39 +00:00
< img width = "687" alt = "kinesis-checkpoints" src = "https://user-images.githubusercontent.com/739782/33036582-b6f3c4b4-cde3-11e7-9334-c4bfbe34d984.png" >
2017-11-20 19:06:46 +00:00
2017-11-20 16:21:40 +00:00
The default checkpoint uses Redis on localhost; to set a custom Redis URL use ENV vars:
```
2017-11-20 17:55:43 +00:00
REDIS_URL=redis.yoursite.com:6379
2016-12-04 08:08:06 +00:00
```
2017-11-20 17:37:30 +00:00
To leverage DynamoDB as the backend for checkpoint we'll need a new table:
2017-11-20 17:55:43 +00:00
< img width = "659" alt = "screen shot 2017-11-20 at 9 16 14 am" src = "https://user-images.githubusercontent.com/739782/33033316-db85f848-cdd8-11e7-941a-0a87d8ace479.png" >
2017-11-20 17:37:30 +00:00
Then override the checkpoint config option:
```go
2017-11-20 17:55:43 +00:00
// ddb checkpoint
ck, err := checkpoint.New(tableName, appName, streamName)
2017-11-20 17:37:30 +00:00
if err != nil {
log.Fatalf("new checkpoint error: %v", err)
}
2017-11-20 17:55:43 +00:00
// consumer with checkpoint
2017-11-20 17:37:30 +00:00
c, err := consumer.New(
appName,
streamName,
consumer.WithCheckpoint(ck),
)
```
2017-11-20 16:21:40 +00:00
2016-05-01 19:20:44 +00:00
### Logging
2016-05-08 01:15:55 +00:00
[Apex Log ](https://medium.com/@tjholowaychuk/apex-log-e8d9627f4a9a#.5x1uo1767 ) is used for logging Info. Override the logs format with other [Log Handlers ](https://github.com/apex/log/tree/master/_examples ). For example using the "json" log handler:
2016-05-01 19:20:44 +00:00
```go
import(
2016-05-08 01:15:55 +00:00
"github.com/apex/log"
"github.com/apex/log/handlers/json"
2016-05-01 19:20:44 +00:00
)
func main() {
// ...
2016-05-08 01:15:55 +00:00
log.SetHandler(json.New(os.Stderr))
log.SetLevel(log.DebugLevel)
2016-05-01 19:20:44 +00:00
}
```
2016-05-01 19:40:30 +00:00
Which will producde the following logs:
```
INFO[0000] processing app=test shard=shardId-000000000000 stream=test
2017-11-20 16:21:40 +00:00
INFO[0008] checkpoint app=test shard=shardId-000000000000 stream=test
INFO[0012] checkpoint app=test shard=shardId-000000000000 stream=test
2016-05-01 19:40:30 +00:00
```
2015-05-23 23:18:10 +00:00
## Contributing
2015-05-23 15:52:08 +00:00
2015-05-23 22:22:58 +00:00
Please see [CONTRIBUTING.md] for more information. Thank you, [contributors]!
2015-05-23 15:52:08 +00:00
[LICENSE]: /MIT-LICENSE
[CONTRIBUTING.md]: /CONTRIBUTING.md
2015-05-23 23:18:10 +00:00
## License
2015-05-23 15:52:08 +00:00
Copyright (c) 2015 Harlow Ward. It is free software, and may
be redistributed under the terms specified in the [LICENSE] file.
[contributors]: https://github.com/harlow/kinesis-connectors/graphs/contributors
2016-05-01 19:45:27 +00:00
> [www.hward.com](http://www.hward.com) ·
> GitHub [@harlow](https://github.com/harlow) ·
> Twitter [@harlow_ward](https://twitter.com/harlow_ward)