API CHANGE: add `to_dataframe` to each table #126

danield137 · 2019-04-02T10:04:34Z

After investing work in #124,
and some internal discussions, we agreed to wait with this PR and reconsider changing the API to give better performance for both vanilla python and pandas use cases, and save some difficult trickery to allow parsing kusto type to dataframe:

Final api would look like

# result is of type KustoResultDataSet
result = client.execute(db, query)
# raw json 
result.tables[0].json()
# iterator with lazy parsing of json
result.tables[0].rows()
# dataframe parsing from raw json
result.tables[0].to_dataframe()

This will cause some memory pressure, so a best practice would probably be:

# either explicitly access a specific table and drop the reference after conversion
df = client.execute(db, query).primary_results[0].to_dataframe()
# or, parse it all
dfs = client.execute(db, query).to_dataframes()

Feel free to add your thoughts, code will be implemented in next couple of weeks.

danield137 · 2019-04-02T12:26:38Z

#127

vladikbr · 2020-08-05T14:40:27Z

Postponed

danield137 added the Discussion label Apr 2, 2019

danield137 assigned toshetah Apr 2, 2019

vladikbr unassigned toshetah Aug 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API CHANGE: add `to_dataframe` to each table #126

API CHANGE: add `to_dataframe` to each table #126

danield137 commented Apr 2, 2019 •

edited

Loading

danield137 commented Apr 2, 2019

vladikbr commented Aug 5, 2020

API CHANGE: add to_dataframe to each table #126

API CHANGE: add to_dataframe to each table #126

Comments

danield137 commented Apr 2, 2019 • edited Loading

danield137 commented Apr 2, 2019

vladikbr commented Aug 5, 2020

API CHANGE: add `to_dataframe` to each table #126

API CHANGE: add `to_dataframe` to each table #126

danield137 commented Apr 2, 2019 •

edited

Loading