A dataset comprising lines from one or more text files.
text_line_dataset(
filenames,
compression_type = NULL,
...,
buffer_size = NULL,
num_parallel_reads = NULL,
name = NULL,
record_spec = NULL,
parallel_records = NULL
)
A dataset
String(s) specifying one or more filenames
A string, one of: NULL
(no compression), "ZLIB"
,
or "GZIP"
.
unused, must be empty.
(Optional.) A tf.int64 scalar denoting the number of bytes to buffer. A value of 0 results in the default buffering values chosen based on the compression type.
(Optional.) A tf.int64 scalar representing the number of files to read in parallel. If greater than one, the records of files read in parallel are outputted in an interleaved order. If your input pipeline is I/O bottlenecked, consider setting this parameter to a value greater than one to parallelize the I/O. If NULL, files will be read sequentially.
(Optional.) A name for the tf.data operation.
(Optional) Specification used to decode delimimted text
lines into records (see delim_record_spec()
).
(Optional) An integer, representing the number of
records to decode in parallel. If not specified, records will be processed
sequentially. This is only applicable if record_spec
is provided