SlideShare a Scribd company logo
open source data collector for unified logging layer
Before Fluentd
After Fluentd
2000+ data driven companies!
Pluggable Architecture
300+ community contributed plugins
Built-in Reliability
Installation & Run
$ gem install fluentd
$ fluentd --setup ./fluent
$ fluentd -c ./fluent/fluent.conf
$ curl -L http://toolbelt.treasuredata.com/sh/install-redhat-td-agent2.sh | sh
$ /etc/init.d/td-agent start
$ /etc/init.d/td-agent stop
$ /etc/init.d/td-agent restart
$ /etc/init.d/td-agent status
td-agent
Configuration
<source>
type tail
format apache2
tag hdfs.apache.access
path /var/log/apache2/access.log
</source>
<match hdfs.*.*>
type webhdfs
host namenode.your.cluster.local
port 50070
path /log/%Y%m%d_%H/access.log.${hostname}
flush_interval 10s
field_separator TAB # or 'SPACE', ‘COMMA'
output_data_type attr:path,status,referrer,agent
</match>
Fluentd + HDFS
20,000 msgs/sec 1.5TB/ day
http://www.slideshare.net/tagomoris/fluentd-and-webhdfs (NHN Japan)
$ fluent-gem install fluent-plugin-webhdfs
• C + Ruby
• 30~40M RAM
• 20,000 events / Sec / Core
• built-in monitoring
• 300+ Plugins
• Ruby / Java / Go / D / Perl / Python / PHP / node.js
• Internet of Things
• Active community
http://www.fluentd.org
sponsored by

More Related Content

Fluentd intro at Tajo seoul meetup