A dataset is a data set of specific object.
To define this domain Specific Object, an encoder is required.
val people = spark.read.parquet("...").as[Person]
Dataset<Person> people = spark.read().parquet("...").as(Encoders.bean(Person.class));
To understand the internal binary representation for data, use the schema function.