Load WARC Files into Apache Spark


[Up] [Top]

Documentation for package ‘sparkwarc’ version 0.1.1

Help Pages

cc_warc Provides WARC paths for commoncrawl.org
spark_read_warc Reads a WARC File into Apache Spark