Parsing multiline records in Scala
By default Spark creates a single element per line. It means that in your case every record is spread over multiple elements which, as stated by Daniel Darabos in the comments, can be processed by different workers. Since it looks like your data is relatively regular and separated by an empty line you should be … Read more