Flutterby™!
: Extracting data from HTML
Extracting data from HTML
2017-06-05 18:46:06.44003+02 by
Dan Lyke
0 comments
Quick Tip: The easiest way to grab data out of a web page in Python:
It’s that simple! Pandas will find any significant html tables on the page
and return each one as a new DataFrame object.
The demo ends up as:
import pandas as pd
calls_df, = pd.read_html("http://apps.sandiego.gov/sdfiredispatch/", header=0, parse_dates=["Call Date"])
calls_df.to_csv("calls.csv", index=False)
[ related topics:
Invention and Design Monty Python Python Furniture
]
comments in ascending chronological order (reverse):
Comment policy
We will not edit your comments. However, we may delete your
comments, or cause them to be hidden behind another link, if we feel
they detract from the conversation. Commercial plugs are fine,
if they are relevant to the conversation, and if you don't
try to pretend to be a consumer. Annoying endorsements will be deleted
if you're lucky, if you're not a whole bunch of people smarter and
more articulate than you will ridicule you, and we will leave
such ridicule in place.
Flutterby™ is a trademark claimed by
Dan Lyke for the web publications at www.flutterby.com and www.flutterby.net.