kafka

This is an old revision of the document!


Apache Kafka is an open-source stream-processing software platform developed by the Apache Software Foundation, written in Scala and Java. The project aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds.

In a nutshell, you say data to it and you can do something with that data. Here we will process data from Oracle through Kafka to big data.

So for example, you can have: Oracle → Kafka → Hadoop/Hbase

To start Kafka, we can use the following script:

Start Kafka

[oracle@edvmr1p0 ~]$ kafka-server-start.sh
USAGE: /opt/kafka/bin/kafka-server-start.sh [-daemon] server.properties [--override property=value]*
[oracle@edvmr1p0 ~]$ kafka-server-start.sh $KAFKA_HOME/config/server.properties
[2020-11-11 09:33:54,319] INFO KafkaConfig values: 
	advertised.host.name = null
	metric.reporters = []
	quota.producer.default = 9223372036854775807
	offsets.topic.num.partitions = 50
	log.flush.interval.messages = 9223372036854775807
	auto.create.topics.enable = true
	controller.socket.timeout.ms = 30000
	log.flush.interval.ms = null
	principal.builder.class = class org.apache.kafka.common.security.auth.DefaultPrincipalBuilder
	replica.socket.receive.buffer.bytes = 65536
	min.insync.replicas = 1
.........
Create a topic as follows

Create Topic

[oracle@edvmr1p0 config]$ kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 -topic ogg12cBigData
Created topic "ogg12cBigData".
Send message to Kafka

Send message to Kafka

[oracle@edvmr1p0 config]$ kafka-console-producer.sh --broker-list localhost:9092 --topic ogg12cBigData
Testing Kafka in the context of OGG 12c for Big Data                  <- First Message

Second message sent to the Kafka cluster                              <- Second Message
  • kafka.1605088064.txt.gz
  • Last modified: 2020/11/11 09:47
  • by andonovj