Wednesday, November 12, 2014

Python - spider and scrape data - test data set phase - oDesk

We need a python developer to script procedure(s) to navigate a web application 4 levels down and scrape data along the way. This is a data set that is publicly available but is not produced as a unified corpus. In addition it's presented in it's web format with a mixture of technologies including Oracle application servers, Javascript and AJAX calls.



This is the test phase. We'll need to collect a subset of the data for initial testing and modeling and once ready we'll move on to the collection of the full data set.

If the developer selected for the test phase successfully collects the data via python he or she will be invited to collaborate in the full collection phase of the project.



We will provide detailed documentation and instructions as well as be available for support.

Deliverables are: an accurate data set in CSV format and/or JSON (to be determined), full python scripts created to collect data set plus full and detailed description of process/sequence required to reproduce the results provided. And finally, a completed post-work questionnaire (related, exclusively, to the work performed).



Thanks for considering our project.



Posted On: November 12, 2014 18:51 UTC

ID: 204779650

Category: Web Development > Other - Web Development

Skills: Array, Array, Array, Array, Array

Country: United States

click to apply



from Online Job Search

No comments:

Post a Comment