Streaming CSV/JSON Object Store Read #2935
Labels
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Currently
CsvOpener
andJsonOpener
call GetResult::bytes which downloads the entire file, prior to feeding it to the appropriate arrow reader.This is not ideal:
Following on from #2677 we now support streaming responses from object storage
Describe the solution you'd like
The underlying challenge is to take arbitrary
Stream<Bytes>
and convert it into aStream<Bytes>
where each stream element contains complete rows, as delimited by a newline character. Once we have thisDelimitedStream
, it is trivial to feed each of these byte chunks individually into the corresponding decoder.Describe alternatives you've considered
We could not do this
The text was updated successfully, but these errors were encountered: