Simple learning project pushing CSV data into Kafka then indexing the data in ElasticSearch.
The indexes are created with a very rudimentary type discovery that uses simple regex patterns.
The Sacramento crime January 2006 dataset contains 7,584 crime records, as made available by the Sacramento Police Department.
- ElasticSearch, 1.8+
- Kafka, 2.10+
- Kibana, 4.1.1+
- Python 2.7+
- sudo pip install -r requirements.txt
- untar the distribution & cd kafka_2.10-0.8.2.1
- start zookeeper: bin/zookeeper-server-start.sh config/zookeeper.properties
- start kafka: bin/kafka-server-start.sh config/server.properties
- create a topic: bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic test
See the Kafka QuickStart Guide