Spark – load CSV file as DataFrame?

spark-csv is part of core Spark functionality and doesn’t require a separate library.
So you could just do for example

df = spark.read.format("csv").option("header", "true").load("csvfile.csv")

In scala,(this works for any format-in delimiter mention “,” for csv, “\t” for tsv etc)

val df = sqlContext.read.format("com.databricks.spark.csv")
.option("delimiter", ",")
.load("csvfile.csv")

Leave a Comment