Building a Distributed Crawler using Akka

Rajesh Muppalla, Indix.

Building a Distributed Crawler using Akka

Rajesh Muppalla - rajesh@indix.com

About Me

About Indix

Data Pipeline @ Indix

Crawler - Requirements

Our Options

Our Choice - Akka

Why Akka?

So what`s the
secret sauce?

Actors

What is an Actor?

With a Diagram

Show me the code

Hello World

Parallelism

Supervision

Clustering

Back to the Crawler

Key Components

Patterns Used

Lessons Learned

Lessons Learned (Continued)

Akka Survey

Future Akka Roadmap

Resources

Questions

Thanks

Extras

Fork me on Github