VB.net spider website get aircraft airline information
$100-130 USD
Në vazhdim
Postuar over 12 years ago
$100-130 USD
Paguhet në dorëzim
This project is in VB.net.
Your task is to write for us some vb6 code that can be used to get aircraft data from the [login to view URL] website and store it in
a database. We will probably change your code to store in an MSSQL database eventually, but you can just have it store in an Access database for now as that's easier.
SEE PROJECT DETAILS FOR COMPLETE DESCRIPTION
## Deliverables
Your task is to write for us some vb6 code that can be used to get aircraft data from the [login to view URL] website and store it in
a database. We will probably change your code to store in an MSSQL database eventually, but you can just have it store in an Access database for now as that's easier.
We also expect you to supply one complete access database from a run of this to prove to us that it ran successfully.
Each field int the database will have a "DateModified" field so that we know when that record chagned. So, if we run your program every two weeks, we can see when a given row was either added or modified.
Of coruse, your code will parse their site as it is now. If their site format changes, your code is expected to be relatively clean so that we can update your code in the future.
Specifically, what we are looking to spider is this:
Start at
[login to view URL]
This has a list of airlines. So, this is the first table that you need.
AIRLINE
AirlineID (index field)
AirlineName text
ICAO text
IATA text
CallSign text
Country text
WebsiteURL text
AirlineName text
AirlineFullName text
AirlineFounded integer
AirlineCeasedOps integer
isDeleted yes/no <- see below
DateModified date/time <- date modified or record created. this should not change if on another run of your code this record has not been modified
Note how the last three fields above are gotten by loading the airline page and it is near the top.
Then, for each airline, the goal is to get information about each aircraft.
AIRCRAFT
-- i think this is all the fields. if you run into more, just add them -- some fields only appear in some cases.
AircraftID (index field)
AirlineID (refers to airline table, long integer)
isHistoric YES/NO = Yes if it appears in the "historic fleet" section. "no" if it is in the "Current Fleet"
CN text
Registratin text - also known as "Registration" - this is the most important field in the app.
LN text
AircraftType text - if you want to save space, you can make another table for this and call this AircraftTypeId, since there will be much duplication
Config text
Delivered text - I think it is better to have this as text since it is not always a clear date.
PrevReg text - maybe a memo is needed, but if all these are < 255 characters, then text is fine
Became text - maybe a memo is needed, but if all these are < 255 characters, then text is fine
Remark memo - i think a memo is needed, but if they are all < 255 characters, then text is fine
Status text
ExitDate text
AircraftName text
DateModified date/time <- date modified or record created. this should not change if on another run of your code this record
isDeleted yes/no <- no in most cases (see below)
PHOTOS
PhotoID (index field, autonumber)
AircraftID (refers to [login to view URL], long integer)
ThumbnailURL text complete URL to thumbnail of image
FullImageURL text complete URL to full-size image
Copyrighht text Appears under the thumbnail as a text
isDeleted yes/no <- see below
DateModified date/time <- date modified or record created. this should not change if on another run of your code this record
what you must deliver:
1. vb.net code that goes out and gets the above data into an access database. our goal is not to put excessive strain on their server, so if your code does this "politely" such as with reasonable interval after some set of requests, this is nice.
2. one complete run of the database - that is, you must deliver an access database that is full with the data as above.
your code:
1. must be clean, decent, vb.net code.
2. must be ready or easily adaptable to run on a server with no UI. however, if you provide a small hook desktop program (basically, one button that says "get site" and maybe with a status indicator) that can be removed easily by decent coders, then this is fine too.
3. must each time it runs update the "date modified" fields only if that field has been modified since the last run.
a question is what should you do if you run into, for example, the case where if one airline has an aircraft with the registration N12345 in a list and then the next time N12345 is completely gone. in this case, you don't delete N12345's record, but rather a) you set the ismodified field to now and b) you set the 'isdeleted' field to 'yes' (this field can set back to 'no' in a future run should it reappear). same for the 'isdeleted' in all fields.
I think this ia very clear description. please let me know if you can do it. in the worst case, we dont need the 'photos' inforamtion, but figure since we are there, we might as well try.
thank you for your bids.