Write a parser for this Wikipedia page:
[login to view URL]
that will write out the data in a spreadsheet with one entry-per-line. The 3 columns are
Romanization of the Country name
Country Name in original script
Language
The format of the data in the Wikipedia page is Romanization in Boldface (if different from the name) then followed by the name in the original language and then the languages in between parenthesis.
As an example, some of the entries on Afghanistan would become:
,Afeganistão,Portuguese
,Afghanistan,Welsh
,An Afganastáin,Irish
,Afganastan,Scots Gaelic
,Afgania,Latin
,Afganio or Afganujo,Esperanto
,Afganistan,Bosnian
,AfganistanCatalan
,Afganistan,Croatian
,Afganistan,Czech
Afgānistān,अफ़गानिस्तान,Hindi
Avghaneti,ავღანეთი,Georgian
This is to be done by a program that would download the source and process the date in the particular format above. This is NOT to be done by hand.A CSV file should be written down with the values at the end.
Paulo Ney
Hi, I can provide a fast & accurate solution for your needs using C#. Shouldn't take more than a few hours, considering that I've done similar projects before. Regards, Cosmin.