Issue
i am trying to save my spark dataframe in tfrecords format , but it's give error.
here is my command ,
df.write.format("tfrecords").mode("overwrite").save()
error message :
Py4JJavaError: An error occurred while calling o1095.save.
: java.lang.ClassNotFoundException: Failed to find data source: tfrecords. Please find packages at http://spark.apache.org/third-party-projects.html
spark version 2.4.5
Solution
You need to make sure that you run your Spark Job with the spark-tfrecord
JAR . The GitHub repository provides some instructions on how to do that.
Answered By - Robert Kossendey
0 comments:
Post a Comment
Note: Only a member of this blog may post a comment.