Flutterby™! : Extracting data from HTML

Next unread comment / Catchup all unread comments User Account Info | Logout | XML/Pilot/etc versions | Long version (with comments) | Weblog archives | Site Map | | Browse Topics

Extracting data from HTML

2017-06-05 18:46:06.44003+02 by Dan Lyke 0 comments

Quick Tip: The easiest way to grab data out of a web page in Python:

It’s that simple! Pandas will find any significant html tables on the page and return each one as a new DataFrame object.

The demo ends up as:

import pandas as pd
calls_df, = pd.read_html("http://apps.sandiego.gov/sdfiredispatch/", header=0, parse_dates=["Call Date"])
calls_df.to_csv("calls.csv", index=False)

[ related topics: Invention and Design Monty Python Python Furniture ]

comments in ascending chronological order (reverse):