as a beginner to big data, I first learned log collection (mainly flume), searched the relevant log collection architecture scheme. If it is flume, most of them recommend and support flume + kafka, as a beginner. I don"t quite understand, one is a distributed log collection system, and the other is why it is used in such a combination as MQ,? I also hope that the elders of the great gods will be grateful for their doubts for me.
in addition, if the log is collected, it needs to be analyzed and processed later, so as to recommend what technology is generally used for analysis.