Wrap-Up

This concludes the chapter. At this point, you should have your obtained document in a format suitable for input to an XML extension. The following few chapters will be devoted to using specific extensions to searching and extracting data from repaired documents.

For the PHP manual section on the tidy extension, see http://php.net/tidy.

For documentation on the tidy library itself, see http://tidy.sourceforge.net/#docs.

For a tidy configuration setting reference, see http://tidy.sourceforge.net/docs/quickref.html.


© Tidy Extension — Web Scraping

>>> Back to TABLE OF CONTENTS <<<
Category: Article | Added by: Marsipan (01.09.2014)
Views: 312 | Rating: 0.0/0
Total comments: 0
avatar