Guides by LAL
Guides by LAL
Go home
Premium Soft
Premium themes / templates
Home
»
2014
»
August
»
30
» php|architect’s Guide to Web Scraping with PHP
20:13
php|architect’s Guide to Web Scraping with PHP
Chapter 1 — Introduction
Intended Audience
How to Read This Book
Web Scraping Defined
Applications of Web Scraping
Appropriate Use of Web Scraping
Legality of Web Scraping
Topics Covered
Chapter 2 — HTTP
Requests
Responses
Headers
Wrap-Up
Chapter 3 — HTTP Streams Wrapper
Simple Request and Response Handling
Stream Contexts and POST Requests
Error Handling
HTTP Authentication
A Few More Options
Wrap-Up
Chapter 4 — cURL Extension
Simple Request and Response Handling
Contrasting GET and POST
Setting Multiple Options
Handling Headers
Debugging
Cookies
HTTP Authentication
Redirection
Referers
Content Caching
User Agents
Byte Ranges
DNS Caching
Timeouts
Request Pooling
Wrap-Up
Chapter 5 — pecl_http PECL Extension
GET Requests
POST Requests
Handling Headers
Debugging
Timeouts
Content Encoding
Cookies
HTTP Authentication
Redirection and Referers
Content Caching
User Agents
Byte Ranges
Request Pooling
Wrap-Up
Chapter 6 — PEAR::HTTP_Client
Requests and Responses
Juggling Data
Wrangling Headers
Using the Client
Observing Requests
Wrap-Up
Chapter 7 — Zend_Http_Client
Basic Requests
Responses
URL Handling
Custom Headers
Configuration
Connectivity
Debugging
Cookies
Redirection
User Agents
HTTP Authentication
Wrap-Up
Chapter 8 — Rolling Your Own
Sending Requests
Parsing Responses
Transfer Encoding
Content Encoding
Timing
Chapter 9 — Tidy Extension
Validation
Tidy
Input
Configuration
Options
Debugging
Output
Wrap-Up
Chapter 10 — DOM Extension
Types of Parsers
Loading Documents
Tree Terminology
Elements and Attributes
Locating Nodes
XPath and DOMXPath
Absolute Addressing
Relative Addressing
Addressing Attributes
Unions
Conditions
Resources
Chapter 11 — SimpleXML Extension
Loading a Document
Accessing Elements
Accessing Attributes
Comparing Nodes
DOM Interoperability
XPath
Wrap-Up
Chapter 12 — XMLReader Extension
Loading a Document
Iteration
Nodes
Elements and Attributes
DOM Interoperation
Closing Documents
Wrap-Up
Chapter 13 — CSS Selector Libraries
Reason to Use Them
Basics
Hierarchical Selectors
Basic Filters
Content Filters
Attribute Filters
Child Filters
Form Filters
Libraries
Wrap-Up
Chapter 14 — PCRE Extension
Pattern Basics
Anchors
Alternation
Repetition and Quantifiers
Subpatterns
Matching
Escaping
Escape Sequences
Modifiers
Wrap-Up
Chapter 15 — Tips and Tricks
Batch Jobs
Availability
Parallel Processing
Crawlers
Forms
Web Services
Testing
That’s All Folks
Appendix A —
Legality of Web Scraping
Chapter B —
Multiprocessing
1
2
3
4
5
Views:
1464
|
Added by:
Marsipan
|
Rating:
0.0
/
0
Total comments:
0
Log in:
Send