From 4af1334cfb8c1c801c4746bf097366b034d0e6da Mon Sep 17 00:00:00 2001 From: Dimitris Zlatanidis Date: Wed, 29 Jun 2016 21:57:00 +0300 Subject: python/w3lib: Added (Library of web-related functions). Signed-off-by: Dimitris Zlatanidis --- python/w3lib/README | 10 ++++++++++ 1 file changed, 10 insertions(+) create mode 100644 python/w3lib/README (limited to 'python/w3lib/README') diff --git a/python/w3lib/README b/python/w3lib/README new file mode 100644 index 0000000000..b2e25c80c4 --- /dev/null +++ b/python/w3lib/README @@ -0,0 +1,10 @@ +This is a Python library of web-related functions, such as: + +remove comments, or tags from HTML snippets +extract base url from HTML snippets +translate entites on HTML strings +convert raw HTTP headers to dicts and vice-versa +construct HTTP auth header +converting HTML pages to unicode +sanitize urls (like browsers do) +extract arguments from urls -- cgit v1.2.3