Hi there.
I need someone to write a program that can auto catch data from this site : [login to view URL]
The program is what I [login to view URL] the data.
Look at the category list in the right site first.
I need the program can auto catch some categories' data.
From [キングダム] to [横浜線ドッペルゲンガー].
*Check the attachment named “category”
You have to output data in the following way:
Data Structure:
id----order
url----URL of the article
cat----category
title----title
magazine----magazine
author----author
genre----genre
character----character
site----goal website
article----the text
entry_data_at----publish time
created_at----catch time
picture----the cover the article
*Check check the attachment named “tip1,tip2”
Explanation:
[cat],means the name of [login to view URL] the category list in the right [login to view URL] can see words like [キングダム] and [トキワ来たれり!!],they are categorise.
[title],check the category [キングダム],turns to a new page,you can see words such as [キングダム 最新 492話 ネタバレ&感想 入隊選抜試験と逸材発見!?] or [キングダム 最新 491話 ネタバレ&感想 秦趙決裂と軍備強化], they are titles.
[article],check one title like [キングダム 最新 492話 ネタバレ&感想 入隊選抜試験と逸材発見!?],turns to a new page,you can see an article with lot of [login to view URL] have to catch the body which from the title(キングダム 最新 492話 ネタバレ&感想 入隊選抜試験と逸材発見!?) to the end of the article (end at the place above [第491話へ][第493話へ] and advertisements).
[entry_data_at],means the publish time of the articel,for example,the publish time of キングダム 最新 492話 ネタバレ&感想 入隊選抜試験と逸材発見!? is the one written under the title - 2016/10/[login to view URL] have to record it by using timestamp,which would turn 2016/10/01 into 1451577600.
[url],means the url of the article,like [login to view URL]
[site],all write as [login to view URL]
[character],for example,
[login to view URL]
You can see words written in blue [第492話 成長への募兵].
In the Developer Tools which is
<span style="font-size: x-large; color: #0000ff;">
<strong>第492話 成長への募兵</strong>
</span>
The number 492 is the [character].
About [author],[magazine],[genre],[picutre],[id] and [created_at],you should do the following step first.
Search any [cat] in [login to view URL],use the first result.
For example,search [キングダム] in [login to view URL],you can get:
作家:原泰久
雑誌・レーベル:ヤングジャンプ
ジャンル: バトル・アクション / 歴史 / 青年マンガ / アニメ化 / 中国史・三国志
So,
[author],means the words after [作家:]. In the example the [author] is [原泰久].
[magazine],means the words after [雑誌・レーベル:], In the example the [magazine] is [ヤングジャンプ].
[genre],means the words after [genre:],need to use "," to separate them. In the example the [genre] is [バトル・アクション,歴史,青年マンガ,アニメ化,中国史・三国志].
[pitucre],the cover of the first [login to view URL] have to catch covers and store [login to view URL] the datebase there should add a data bar of [pictuer] and have url of each cover.
[id],means the order, the first one is 1, the second one is 2, etc.(MySQL autoincrement field)
[created_at],means the time you catch the article,also have to record by using timestamp. For example,if I catch the date on UTC/GMT+08:00 2016/10/11 14:40:30, so the [created_at] should be 1476168030.
Use [キングダム] as the example, do what I said,you can get:
id:1
url:[login to view URL]
cat:キングダム
title:キングダム 最新 492話 ネタバレ&感想 入隊選抜試験と逸材発見!?
magazine:BE・LOVE
author: ヤングジャンプ
genre: バトル・アクション,歴史,青年マンガ,アニメ化,中国史・三国志
character:492
site:[login to view URL]
article:<h1 class="entry-title">......
entry_data_at:1451577600
created_at:1476168030
*Check the explanation named “database sample”.
This is what I [login to view URL] have to make the program to catch data in this way to make my server can recognize the data.
Need to catch data 2 hours one time.
Need to send me the program you write to catch data.
Need he full data scraper , also need the program that can catch new data and not catch old data again.
Tap 113114 in your bid.
Hi I am 4 yrs experienced Web developer (php) and having experience on scrapping nearly 2 yrs. I can scrap the data from any site based on requirement analyzed.
Hi,
I am interested in your project and would like to offer you my services for 300$. I have built scrapping applications before, few of them are
https://www.freelancer.com/projects/php/Grab-public-Facebook-data/
https://www.freelancer.com/projects/Scrap-classified-website/
https://www.freelancer.com/projects/Java-C-Sharp-Programming/Scraper/
https://www.freelancer.com/projects/C-Sharp-Programming-NET/Scrapping-websites/
Let's work together.
Best Regards,
Muhammad Shafeeq
Hello Sir,
I have rich experience in PHP/MySQL.
I have gone through the details specification of the project, I can do this project as I have rich experience in PHP/MySQL expertise since 7+ years and I have done many similar projects so I can do this project within estimated time-frame and cost price with best quality. I am ready to start this right now. I am full time freelance developer.
90+ projects are completed successfully on freelancer.com
My Past projects: https://www.freelancer.com/u/harish1984.html ( done on freelancer.com )
Thanks,
Harish