How to import a text file on AWS S3 into pandas without writing to disk

pandas uses boto for read_csv, so you should be able to:

import boto
data = pd.read_csv('s3://bucket....csv')

If you need boto3 because you are on python3.4+, you can

import boto3
import io
s3 = boto3.client('s3')
obj = s3.get_object(Bucket="bucket", Key='key')
df = pd.read_csv(io.BytesIO(obj['Body'].read()))

Since version 0.20.1 pandas uses s3fs, see answer below.

More Related Contents:

How to write a file or data to an S3 object using boto3
Boto3 to download all files from a S3 Bucket
Save Dataframe to csv directly to s3 Python
Read file content from S3 bucket with boto3
check if a key exists in a bucket in s3 using boto3
Error “Read-only file system” in AWS Lambda when downloading a file from S3
Listing contents of a bucket with boto3
Retrieving subfolders names in S3 bucket from boto3
How to read a list of parquet files from S3 as a pandas dataframe using pyarrow?
How to upload File in FastAPI, then to Amazon S3 and finally process it?
Can I use boto3 anonymously?
Pandas in AWS lambda gives numpy error
Getting S3 objects’ last modified datetimes with boto
How to specify credentials when connecting to boto3 S3?
Open S3 object as a string with Boto3
how to copy s3 object from one bucket to another using python boto3
How to return a PDF file from in-memory buffer using FastAPI?
Reading an JSON file from S3 using Python boto3
Download a folder from S3 using Boto3
how to calculation cost time [closed]
Reshape wide to long in pandas
How do convert a pandas dataframe to XML?
How to add hovering annotations to a plot
Converting between datetime and Pandas Timestamp objects
Shift NaNs to the end of their respective rows
multi index plotting
Pandas how can ‘replace’ work after ‘loc’?
Pandas Dataframe: split column into multiple columns, right-align inconsistent cell entries
Pandas: Reading Excel with merged cells
Creating multiple Excel worksheets using data from a pandas DataFrame

More Related Contents:

Leave a Comment Cancel reply