You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’m having an issue with the QNL post harvest task that I noticed when trying to fix the traject config. I raised the question on stack overflow https://stackoverflow.com/questions/74293940/aggregating-multiple-data-types-in-pandas-groupby but no answer so far. The basic problem is a column might have multiple data types which cause problems when I try to merge rows together. e.g. the subject_name_namePart column has strings and lists. I can’t figure out how to fix this in pandas. Is there a way to do it before the data gets into a dataframe? For example, if a value is a string, can I force it into a list with one string value "string" > ["string"]?
I can't think of a reason why we would ever want to preserve mixed values in a Pandas column. I think we would always be okay if we automatically converted all strings to lists with one string value. We could then use parse_csv in traject on all fields.
Currently this is blocking me from transforming QNL data.
The text was updated successfully, but these errors were encountered:
I’m having an issue with the QNL post harvest task that I noticed when trying to fix the traject config. I raised the question on stack overflow https://stackoverflow.com/questions/74293940/aggregating-multiple-data-types-in-pandas-groupby but no answer so far. The basic problem is a column might have multiple data types which cause problems when I try to merge rows together. e.g. the subject_name_namePart column has strings and lists. I can’t figure out how to fix this in pandas. Is there a way to do it before the data gets into a dataframe? For example, if a value is a string, can I force it into a list with one string value
"string" > ["string"]
?I can't think of a reason why we would ever want to preserve mixed values in a Pandas column. I think we would always be okay if we automatically converted all strings to lists with one string value. We could then use
parse_csv
in traject on all fields.Currently this is blocking me from transforming QNL data.
The text was updated successfully, but these errors were encountered: