Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

external_author_id and author relationship #32

Open
EvanCarroll opened this issue Aug 28, 2018 · 3 comments
Open

external_author_id and author relationship #32

EvanCarroll opened this issue Aug 28, 2018 · 3 comments

Comments

@EvanCarroll
Copy link

This field is confusing to me, some external_author_id have multiple authors

SELECT distinct external_author_id , author
FROM rustweets.tweets WHERE external_author_id = 753000000000000000;
 external_author_id |     author      
--------------------+-----------------
 753000000000000000 | ANGELABACH991
 753000000000000000 | ANGELA_LATTKE
 753000000000000000 | BECKRALFBECK265
 753000000000000000 | CHRISTINAPOOL61
 753000000000000000 | DARRELL_H_HUNT
 753000000000000000 | DOMINIKKELLER22
 753000000000000000 | EHERMANN66
 753000000000000000 | ERIKADIXONLOVE
 753000000000000000 | JOACHIMBUCHWITZ
 753000000000000000 | LARSWOLFLARS
 753000000000000000 | LGBTUNI
 753000000000000000 | LUISSTOCKBERG
 753000000000000000 | MALTE_ROSS
 753000000000000000 | MANUELKROSSS
 753000000000000000 | MARGARETHKURZ
 753000000000000000 | MARMARSCH1
 753000000000000000 | PETERSCHULZ541
(17 rows)

@EvanCarroll EvanCarroll changed the title External_Author_Id External_Author_Id and Author relationship Aug 28, 2018
@EvanCarroll EvanCarroll changed the title External_Author_Id and Author relationship external_author_id and author relationship Aug 28, 2018
@gsmith-to
Copy link

I was trying to split this to an 'author' table, and a 'tweets' table, and found that none of the fields below are consistent with alt_external_id, i.e. for each field 'f' in the list you can find a pair of records which have the same alt_external_id, but different values of 'f':

external_author_id
author
account_type
account_category
new_june_2018

Likewise there seem to be no fields consistent with external_author_id

@EvanCarroll
Copy link
Author

So the conclusion is that the external_author_id is trash.

@EvanCarroll
Copy link
Author

BTW, new_june_2018 is not unique with author,

SELECT author, count(distinct new_june_2018) FROM rustweets.tweets GROUP BY author having count(distinct new_june_2018) > 1;
author | count
-----------+-------
MONEYFORM | 2
(1 row)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants