Spark Streaming for beginners

Whether you are running an eCommerce store and want ¬†to put up a dash board which shows the number of ¬†orders processed every minute or run a very popular blog and would like to display trending articles on your web site or any other scenarios like this, all of these…

PySpark tips for beginners

Be careful when you use .collect()Do not call .collect() on RDD or data frame. Your driver may go out of memory if RDD or data frame is too large to fit on a node. Use take() function instead. You can specify the count with take that reduces the number…