Apr 14 2021 08:09 AM - edited Apr 14 2021 08:11 AM
Hi everyone! I've just started using PQ so i'm sorry if this is a dumb question.
I'm trying analyse COVID-19 data in my state, in Brazil. There are two national databases, one for 2020 and another 2021 - available here-
SRAG 2020 - Banco de Dados de Síndrome Respiratória Aguda Grave
SRAG 2021 - Banco de Dados de Síndrome Respiratória Aguda Grave
Using PQ I filtered data only about my state, and combined both databases. However, there is something going wrong. There are two problems
1) Excessive number of lines.
(2020 filtered base) 5096 lines + (2021 filtered base) 14684 = 29368 ???
Why are there more lines in the concatenated table then in both tables together?
2) The concatenated table is not showing part of the data, as I've checked manually with the filters
2021 table^ (1st pic) Concatenated table^(2nd pic)
I'm using Microsoft 365 for Business
What can I do to solve those 2 problems? Thank you very much!
Apr 14 2021 09:07 AM - edited Apr 14 2021 09:16 AM
Solution@pedrofim14 It seems like you are appending the 2020 data to itself. 14684 X 2 = 29368
Apr 14 2021 11:10 AM
I agree with @Riny_van_Eekelen , but it's better if you share the file to be sure. These days there tons of files working with COVID data, I don't think in this one is something unique what's worth to protect, especially initial version.
Apr 15 2021 12:53 PM
@Sergei Baklan @Riny_van_Eekelen you were precisely right! I've corrected it and both problems are now solved. Thank you very much for your attention! :D
I'll leave the database here anyway, as Sergei suggested: https://observatoriosocialmt-my.sharepoint.com/:f:/g/personal/pedro_fim_observatoriosocialmt_org/Epi...
Apr 14 2021 09:07 AM - edited Apr 14 2021 09:16 AM
Solution@pedrofim14 It seems like you are appending the 2020 data to itself. 14684 X 2 = 29368