- Web Scraping
- HTML table structuring tags
table
ol
andul
- Processing HTML as a tree
- Application Programming Interfaces (APIs)
- Determining HTTP
- Parameters within HTTP
- Data Acquisition and Processing
Day | Reading | Reading Questions |
---|---|---|
Monday | Chapter 22 | None |
Tuesday | - | |
Wednesday | Chapter 23, 23.1 and 23.2 | None |
Friday | Chapter 23, 23.3 | None |
HW | Day Out | Day Due | Contents |
---|---|---|---|
HW_4.6 | Tuesday | Thursday | Web Scraping Practicum |
HW_4.7 | Friday | Monday | APIs |
Web Scraping Practicum and Question Set. This is also the basis for
HW_4.6
.
There is little new data systems concepts to introduce this week. We start the week with web scraping. This is primarily an application of the tree structured format from earlier in the semester, and using it to extract two-dimensional data contained within web pages, and acquiring the data through HTTP. Similarly, learning about APIs is more about understanding the provider’s requirements for requests and translating to the proper HTTP for making the requests.