Find Jobs
Hire Freelancers

Save web pages as HTML files / scrape webpage

$10-30 USD

I përfunduar
Postuar over 7 years ago

$10-30 USD

Paguhet në dorëzim
This task is about systematically downloading web pages and saving them as HTML. No fancy scraping skills are required. In a nutshell: A. I will give you a list of about 21K names. These will be used to search for webpages and save them as HTML. B. The search results and the names of the downloaded files will be recorded in “[login to view URL]”. C. You will need to provide me with the code that did the work for you after everything is finished. If you are interested, please read the instructions and submit: I. A bid. II. An estimate of how long this will take you. III. A very brief explanation of how you will execute this task. IV. IMPORTANT: An example of the deliverables (detailed below) using a few names. Please, do this manually -- no significant time investments or coding are required to do this. I just want to make sure that everything has been clearly understood. These are the instructions in detail: 1. The comma-delimited text file “[login to view URL]” is a list of 21160 names with 4 columns: ROWID, NOMBRES, APELLIDO_PATERNO, and APELLIDO_MATERNO. You will use the last three to download the webpages. 2. For each row in [login to view URL], go to [login to view URL] and enter the NOMBRES, APPELIDO_PATERNO, and APELLIDO_MATERNO in the search engine. Then click on “buscar”. 3. Click on the person that EXACTLY matches the information entered in the step above. (Note: if more than one result matches the exact information entered, or if no result matches the exact information, you will need to click on every person that resulted from the search and execute the steps below for each case). 4. After clicking “search”, make sure that you are in the in the HISTORIAL PARTIDARIO tab. You can recognize this because the URL finishes in “IdTab=0”. Save that web page. The name of this file should be “IdTab0_IdPolitico########”, where ######## is the politician’s id number. You can find it in the URL. There should be a one-to-one relationship between individuals and IdPolitico. I’ll be using the HTML code to scrape some information. You only need to save the web page. 5. Now click on the PROCESOS ELECTORALES tab. You can recognize the PROCESOS ELECTORALES subtab because the URL finishes in “IdTab=1”. Save that webpage. The name of this file should be “IdTab1_IdPolitico########.html, where ######## is the politician’s id number. 6. Then click in every “HOJA DE VIDA” available in the subtab PROCESOS ELECTORALES. We are only interested in those of the 2010 or 2006 elections. Ignore the HOJA DE VIDA of the 2014 election. Note that the links to the HOJAS DE VIDA are embedded in the PROCESOS ELECTORALRES sub-table. The link in the uppermost part of the webpage saying “ver hoja de vida” is not the one we want. Click in each “HOJA DE VIDA” and save it. The name of this file should be “CVYYYY_IdPolitico########”, where ######## is the politician’s id number and YYYY is the election year. You can figure out the election year from the HOJA DE VIDA url as well. If there’s more than one HOJA DE VIDA per politician, save them all unless it is a HOJA DE VIDA from the 2014 election. 7. Record all your steps in “[login to view URL]”. The idea is to save all the URLs from which information was downloaded and the corresponding file names. See the attached example for details. 8. I am attaching and example ([login to view URL]), the name list, and a template for the logfile. The deliverables for this project are: a) All downloaded files. b) A complete logfile (XLSX). c) The code you used to download the information. Some other notes: - Make sure to read the names in Unicode UTF-8. There are characters, such as Ñ, that may be a problem if you don’t. - Make sure to make the downloaded HTML pages and logfile compatible with Unicode UTF-8 - Everything will be done through Freelancer.com. This includes communication, payments, and others. Thanks. Please, if you have any questions, just send me a message.
ID e Projektit: 11520104

Rreth projektit

14 propozime
Projekt në distancë
Aktive 8 yrs ago

Po kërkoni të fitoni para?

Përfitimet e ofertës për Freelancer

Vendosni buxhetin dhe afatin tuaj
Paguhuni për punën tuaj
Përshkruani propozimin tuaj
Është falas të regjistrohesh dhe të bësh oferta për punë
I dhënë për:
Avatari i Përdoruesit
Hi there, I am the best candidate to complete this job. I have worked previously on several big scale scraping projects. In my last project, I created an automated crawler on Amazon Web Services which has currently crawled more than 5 Million records from many websites and is still going on. Here is a sample code from the above project which I used to crawl from Macy's website -> [login to view URL] Because of my experience with scraping, I could assure you I could complete this task within 3 days and I am sure we can continue this relationship with further projects. I would be coding the project in Python using requests and Beautiful Soup library and would be delivering you all the deliverables that you have mentioned as well as the code for the project. If you are interested or want to know more about me please don't hesitate to contact me. You can check some of my work at [login to view URL] I would be sending HTTP requests using requests module and then simply saving the content I get from the request into an HTML file and recording steps in an excel file. This is my first time on this site and hence you can see that I do not have any ratings/testimonials to show you. I can assure you that if you work with me once, you will always work with me for this kind of projects. Give me a chance and you won't regret. Regards, Anchit
$10 USD në 2 ditë
5,0 (1 review)
1,4
1,4
14 freelancers are bidding on average $185 USD for this job
Avatari i Përdoruesit
Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi
$250 USD në 5 ditë
5,0 (325 përshtypje)
7,4
7,4
Avatari i Përdoruesit
Looking forward to discuss further about the project details and deliver the same to your specifications
$166 USD në 1 ditë
4,9 (169 përshtypje)
7,6
7,6
Avatari i Përdoruesit
I'm web scraping expert with HUNDREDS of completed projects (see my reviews) that's why I'm sure you'll be impressed with my work. I can create such program you want in less than 3 days and I can offer you reasonable price here. You have pretty good project description so all I need to start is milestone payment from you. Thanks. Roman
$210 USD në 3 ditë
4,9 (345 përshtypje)
7,0
7,0
Avatari i Përdoruesit
Hi there - My name is Jhalak. I’ve read your brief and can see that you’d like to build an Website. My team has years experience designing and developing mobile apps and Websites as well as SEO.I would approach your project by starting with wireframes and getting the site completed, before starting the actual development phase. I am highly qualified for this project and would love to speak with you further about taking this project on. If you'd like to view my previous work, take a look at my Freelancer Portfolio. Regards, Jhalak Thanks, Diamond looking forward for your reply.
$1 388 USD në 1 ditë
4,8 (186 përshtypje)
6,8
6,8
Avatari i Përdoruesit
Yes Sir let's explore the requirement to clarify in details also if you want us to share our skill and previous work let us know.. hope to hear you soon.. RIMSHA
$142 USD në 6 ditë
4,7 (159 përshtypje)
7,0
7,0
Avatari i Përdoruesit
Hi there, I have read the project description. I can write a program/script to get this html files.. I have good web scraping reviews for my past projects.. Hope to hear from you..
$50 USD në 1 ditë
5,0 (136 përshtypje)
6,2
6,2
Avatari i Përdoruesit
Hello My name is Pranav I have checked the details shared by you will do done exactly what you want Please consider my bid and we can discuss more so that i can assist you in better way our services always with you even we complete the project My Portfolio https://www.freelancer.com/u/amitarai.html
$100 USD në 7 ditë
4,8 (105 përshtypje)
6,1
6,1
Avatari i Përdoruesit
I am a skilled freelancer who has a working experience of more then 4 year I have working experience on- Web Research, lead generation, data entry, data mining, lead collection, All types of scrapping tools like - data toolbar, web hearvy, VWR etc..... I have great experience that would fit best. if hired by you i will deliver work of high standard and provide exceptional contribution in serving your task.
$10 USD në 1 ditë
5,0 (30 përshtypje)
5,4
5,4
Avatari i Përdoruesit
Hello. I got acquainted with the information you supplied. interested in the project. I understand perfectly the essence of the task. I have a lot of web development experience, I have experience in similar projects. I would be happy to participate in your project and implement it in reality. I have a great desire to work. I propose to discuss in more detail all the details of your project budget. I would like to get acquainted with the detailed technical task. I look forward to our cooperation. I realize your project by simply copying shtml code, scripts and other files. I'll wait for your answer in the chat.
$35 USD në 2 ditë
4,9 (21 përshtypje)
4,6
4,6
Avatari i Përdoruesit
5 Reasons why you should hire me for your custom website and Application Development. 1. Available 24/7 upon your request 2. Delivery on-time with 100% satisfaction 3. Always think beyond boundaries and provide user friendly solution 4. Provide excellence with commitment 5. Most important, Free technical support for lifetime. If you are looking for custom website development like Intranet application, web directory, online community, online portal, online connecting consumer and vendor, you search will end here. About Me: More, over the last 3 years, I have developed a wide range of websites and apps using .Net, PHP, Web based application, Desktop based software, MS Sql and MySQL including sites for start up companies and small businesses. My core competency lies in complete end-end management of a new website development project, and I am seeking opportunities to build websites from the ground up for you or your business. While working on your project, I am taking care of Clean and neat design Daily communication to show progress Unit testing with 100% results Free lifetime support for any technical issue! Thank you
$20 USD në 6 ditë
4,3 (23 përshtypje)
4,9
4,9
Avatari i Përdoruesit
Experiencia amplia en desarrollo web y backend, entiendo a la perfeccion el modo rest y la metofologia scraping: Lectura y ejecucion de mouse, desencriptador web, url injection y codigo bajo nivel en windows.
$166 USD në 5 ditë
0,0 (0 përshtypje)
0,0
0,0

Rreth klientit

Flamuri i UNITED STATES
Durham, United States
5,0
3
Mënyra e pagesës u verifikua
Anëtar që nga gush 6, 2016

Verifikimi i klientit

Faleminderit! Ne ju kemi dërguar me email një lidhje për të kërkuar kredinë tuaj falas.
Ndodhi një gabim gjatë dërgimit të email-it tuaj. Ju lutemi provoni përsëri.
Përdorues të regjistruar Punë të postuara
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Po ngarkohet shikimi paraprak
Leja u dha për Geolocation.
Seanca e hyrjes ka skaduar dhe ke dalë. Hyr sërish.