Web scraping

Revision as of 20:09, 20 August 2008 by rosettacode>Paddy3118 (New page. Python example. given)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Create a program that downloads the time from this URL: http://tycho.usno.navy.mil/cgi-bin/timer.pl and then prints the current UTC time by extracting just the UTC time from the web pages HTML.

Task
Web scraping
You are encouraged to solve this task according to the task description, using any language you may know.

Only use libraries that come at no extra monetary cost with the programming language and that are widely available and popular such as CPAN for Perl or Boost for C++.

Python

<python> import urllib

page = urllib.urlopen('http://tycho.usno.navy.mil/cgi-bin/timer.pl') for line in page:

   if ' UTC\n' in line:
       print line.strip()[4:]
       break

page.close() </python> Sample Output:

Aug. 20, 19:50:38 UTC