Investigations on Spark Join

by Sachin Tyagi

(Note - this post is extracted from an email on the topic and has been posted on labs.imaginea.com retroactively.)

TLDR: There are things you can do to improve your RDD join performance on Spark in some cases.