How to access s3a:// files from Apache Spark?
Having experienced first hand the difference between s3a and s3n – 7.9GB of data transferred on s3a was around ~7 minutes while 7.9GB of data on s3n took 73 minutes [us-east-1 to us-west-1 unfortunately in both cases; Redshift and Lambda being us-east-1 at this time] this is a very important piece of the stack to … Read more