- Mar 1, 2000
- 30,890
- 5,001
- 126
We often hire 3rd party contractors to build us simple applications that allow us to do some data-mining en masse off of various public websites.
In most scenarios, we feed the program (or the 3rd party directly) a csv of the data fields we want entered into the forms on the website(s), and it basically automates the task of entering the information into the forms and capturing the results (either in a screen shot, or a return csv of the data etc etc)
I was curious as to how they go about doing this. Are they just parsing the HTML of the document itself and inserting where applicable or ???
** please note that these sites are set up for these sort of uses and have no problems with such practices.
In most scenarios, we feed the program (or the 3rd party directly) a csv of the data fields we want entered into the forms on the website(s), and it basically automates the task of entering the information into the forms and capturing the results (either in a screen shot, or a return csv of the data etc etc)
I was curious as to how they go about doing this. Are they just parsing the HTML of the document itself and inserting where applicable or ???
** please note that these sites are set up for these sort of uses and have no problems with such practices.