HttpReader
- class torchdata.datapipes.iter.HttpReader(source_datapipe: IterDataPipe[str], timeout: Optional[float] = None)
Takes file URLs (HTTP URLs pointing to files), and yields tuples of file URL and IO stream.
- Parameters
source_datapipe – a DataPipe that contains URLs
timeout – timeout in seconds for HTTP request
Example
>>> from torchdata.datapipes.iter import IterableWrapper, HttpReader >>> file_url = "https://raw.githubusercontent.com/pytorch/data/main/LICENSE" >>> http_reader_dp = HttpReader(IterableWrapper([file_url])) >>> reader_dp = http_reader_dp.readlines() >>> it = iter(reader_dp) >>> path, line = next(it) >>> path https://raw.githubusercontent.com/pytorch/data/main/LICENSE >>> line b'BSD 3-Clause License'