Forum Discussion
Hemanth_R
Sep 15, 2022Copper Contributor
Extraction of data from Website
Hi all,
I am trying to extract data from a website ( https://csr.gov.in/content/csr/global/master/home/home/csr-expenditure--geographical-distribution/state/district/companies.html?=Bangalore%20Urban=Karnataka=FY%202020-21 ) but, the table is not showing in the 'Navigator Pane'
Please help on the same
- Patrick2788Silver Contributor
The URL doesn't appear to be 'scrape friendly'. The address does not change if you increase the number of results per page. The total results per page is low so you'd have to comb the pages manually. Also, there's not much you can get from View Page Source (Everything seems to be back ended). It might be worth contacting the site to request report.
- mtarlerSilver Contributorthat website seems to take a very long time to load the data. what exactly are you trying to do?
- Hemanth_RCopper ContributorTrying to download the list of companies and their expenditure
S.No Company Name(s) Amount (INR Cr.)
1 Sbi Cards And Payment Services Limited
8.87
2 Biocon Limited
6.46
3 Biocon Biologics Limited
3.1- mtarlerSilver ContributorLike I mentioned there is a long delay before the page will show the data (~15sec) and if you go to the 'Explore CSR Data' tab each of those reports request a captcha. i suspect they are intentionally doing this to prevent automated data mining and you will need to download individual reports or contact them to see if there is another option.