File: //usr/local/ssl/share/doc/python-urlgrabber-3.1.0/ChangeLog
2006-09-22 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 3.1.0
2006-09-22 Michael D. Stenner <mstenner@linux.duke.edu>
* test/grabberperf.py, urlgrabber/grabber.py,
urlgrabber/keepalive.py, urlgrabber/sslfactory.py:
Added James Bowes' SSL patch to use M2Crypto when available.
2006-09-21 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2006-09-21 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 3.0.0
2006-07-20 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2006-07-20 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.10
2006-07-20 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/: byterange.py, grabber.py, keepalive.py:
Added patch from James Bowes to make keepalive, byteranges, etc.
work with https (oops!)
2006-04-04 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/keepalive.py:
Fixed a minor error reporting bug due to changes in python 2.4.
2006-03-22 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Patch from Jeremy Katz to catch read errors after the file has been
opened.
2006-03-02 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2006-03-02 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.9
2006-03-02 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py:
Added tests to make sure that the "quote" option works as
advertised.
2006-03-02 Michael D. Stenner <mstenner@linux.duke.edu>
* scripts/urlgrabber, test/test_grabber.py, urlgrabber/grabber.py:
Significant improvement to URL parsing. Parsing is now broken out
into a separate class (URLParser). It will now (by default) guess
whether a URL is already quoted, properly handle local files and
URLs on windows, and display un-quoted versions of the filename in
the progress meter.
2006-02-22 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2006-02-22 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.8
2006-02-22 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/: grabber.py, mirror.py:
Added a reget progress bar patch from Menno, and fixed the annoying
_next IndexError bug. Thanks to Edinelson Keiji Shimokawa for
getting me looking in the right direction.
2005-10-22 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-10-22 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog, TODO, urlgrabber/__init__.py:
Updated TODO and preparing for release.
2005-10-22 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py, test/test_keepalive.py,
test/test_mirror.py, test/support/testdata/test_post.php,
urlgrabber/byterange.py, urlgrabber/grabber.py,
urlgrabber/keepalive.py, urlgrabber/mirror.py:
Added some slick logging support to urlgrabber. It can be
controlled by the calling application or enabled from an
environment variable.
Allowed for HTTP POSTs by passing in the kwarg "data", just as for
urllib2.
2005-08-19 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/progress.py:
Another minor simplification offered by the WTF folks.
2005-08-18 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/progress.py:
Fixed a minor formatting bug, helpfully pointed out by the nice
folks at the daily WTF:
http://thedailywtf.com/forums/41204/ShowPost.aspx
2005-08-17 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py, urlgrabber/grabber.py:
Added Menno's idea for catching KeyboardInterrupt. This allows the
program to do things like hit the next retry, jump to the next
mirror, etc. when the user hits ctrl-c.
Also modified the behavior of failure_callback slightly. It now
gets called for EVERY failure, even the last one. That is, if
there's a failure on the last try, the failure callback gets called
and (assuming it doesn't raise anything) the exception gets raised.
Previously, it would not be called on the last failure. This is
another MINOR compatibility break.
2005-08-17 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py, test/test_mirror.py, urlgrabber/grabber.py:
except HTTPError, which basically means all HTTP return codes that
are not 200 and not handled internally (such as the 401
Unauthorized code). Previously, these were caught as IOErrors (of
which HTTPError is an indirect subclass) and re-raised as
URLGrabErrors with errno 4. They are NOW reraised as URLGrabErrors
with errno 14, and with the .code and .exception attributes set to
the HTTP status code and HTTPError exception object respectively.
This represents a minor compatibility break, as these "errors" are
now not handled the same way. They will be raised with a different
errno and errno 14 is not in the default list of retrycodes
(whereas 4 is).
2005-06-27 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog, urlgrabber/byterange.py:
Fixed two namespace-related bugs.
2005-05-19 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_keepalive.py, urlgrabber/keepalive.py:
started using sys.version_info rather than parsing sys.version
2005-04-04 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/keepalive.py:
Fixed a typo in that last change. Always remember to do "make
test" before committing :)
2005-04-04 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/keepalive.py:
Changed the version checking so SUSE's python "2.3+" version string
doesn't cause trouble.
2005-03-14 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py, urlgrabber/grabber.py:
Added tests to check proxy format. Otherwise, missing proto could
lead to very obscure exceptions.
2005-03-08 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-03-08 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.6
2005-03-08 Michael D. Stenner <mstenner@linux.duke.edu>
* makefile, setup.py:
Changed development status to Beta an fixed a small build bug. It
wasn't including MANIFEST and so wasn't building things properly
from the distributed tarballs.
2005-03-03 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-03-03 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.5
2005-03-03 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Fixed a bug caused by the recent proxy fix for python 2.2. This
new bug came up when both the FTPHandler and FTPRangeHandler were
pushed in.
2005-02-28 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-02-28 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.4
2005-02-28 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Added http_headers and ftp_headers options.
2005-02-25 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, test/base_test_code.py, test/test_grabber.py,
test/support/squid/squid-setup, test/support/squid/squid.conf,
test/support/testdata/README, test/support/testdata/reference,
test/support/testdata/short_reference,
test/support/testdata/mirror/broken/broken.txt,
test/support/testdata/mirror/broken/reference,
test/support/testdata/mirror/broken/short_reference,
test/support/testdata/mirror/broken/test1.txt,
test/support/testdata/mirror/broken/test2.txt,
test/support/testdata/mirror/m1/broken.txt,
test/support/testdata/mirror/m1/reference,
test/support/testdata/mirror/m1/short_reference,
test/support/testdata/mirror/m1/test1.txt,
test/support/testdata/mirror/m1/test2.txt,
test/support/testdata/mirror/m2/broken.txt,
test/support/testdata/mirror/m2/reference,
test/support/testdata/mirror/m2/short_reference,
test/support/testdata/mirror/m2/test1.txt,
test/support/testdata/mirror/m2/test2.txt,
test/support/testdata/mirror/m3/broken.txt,
test/support/testdata/mirror/m3/reference,
test/support/testdata/mirror/m3/short_reference,
test/support/testdata/mirror/m3/test1.txt,
test/support/testdata/mirror/m3/test2.txt,
test/support/vsftpd/ftp-server-setup,
test/support/vsftpd/vsftpd.conf,
test/support/vsftpd/vsftpd.ftpusers,
test/support/vsftpd/vsftpd.user_list, urlgrabber/grabber.py:
Added data and instructions for setting up ftp and proxy servers
for testing and also added proxy tests. Fixed two bugs related to
proxies. One bug made it so the first time proxy data was passed
in, it was not actually used. The other prevented proxies from
working for ftp or non-keepalive http using python 2.2. The latter
is a bug in urllib2, but is now worked around (somewhat hackishly,
I admit) in urlgrabber. LocalWords: CVS
2005-02-22 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-02-22 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.3
2005-02-21 Michael D. Stenner <mstenner@linux.duke.edu>
* README:
updated README with instructions for building rpms and notes about
python version compatibility.
2005-02-21 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
updated project url
2005-02-21 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Fixed minor exception bug.
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.2
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Added http character encoding patch from Chris Lumens.
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py:
slight cleanup - no new code
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.1
2005-02-14 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_keepalive.py, test/test_mirror.py,
urlgrabber/byterange.py, urlgrabber/grabber.py,
urlgrabber/keepalive.py:
Fixed python 2.4 bug - added .code attribute to returned file
objects. Changed keepalive.HANDLE_ERRORS behavior for the way 2.4
does things.
2005-02-08 Ryan Tomayko <rtomayko@naeblis.cx>
* makefile:
Added Python 2.4 to lists of pythons to test..
2005-02-04 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/byterange.py:
Fixed RH bug #147124. range_tuple_header was not accounting for a
possible None value.
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=147124
2005-01-14 Ryan Tomayko <rtomayko@naeblis.cx>
* test/test_grabber.py:
Fixed proftpd test (was raising error when it shouldn't have been)
2005-01-14 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/: grabber.py, progress.py:
Applied urlgrabber-add-text.patch from Terje Rosten
2004-12-12 Ryan Tomayko <rtomayko@naeblis.cx>
* test/base_test_code.py, test/test_grabber.py,
urlgrabber/byterange.py:
Fix for RH bug #140387
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=140387
2004-10-17 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Added except for socket resource error when reading from urllib2
file object.
2004-10-16 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Fixed AttributeError problem that was pretty common after additions
for timeout support.
2004-10-08 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO, urlgrabber/grabber.py:
Added timeout support.
2004-09-07 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, scripts/urlgrabber, urlgrabber/__init__.py,
urlgrabber/grabber.py, urlgrabber/mirror.py,
urlgrabber/progress.py:
Added Ryan's progress changes in. That still has some quirks I
need to iron out.
Made all urlgrabber imports relative to work more nicely with yum's
package layout. This was straightforward except for the
__version__ import, which I solved in a slightly icky way.
2004-08-25 Michael D. Stenner <mstenner@linux.duke.edu>
* MANIFEST.in:
Added "MANIFEST" to MANIFEST.in. Without this, one couldn't
correctly rebuild an rpm (for example) from the distributed
tarball.
2004-08-24 Ryan Tomayko <rtomayko@naeblis.cx>
* test/threading/batchgrabber.py:
Updates for Michael's multifile progress bar changes.
2004-08-21 Michael D. Stenner <mstenner@linux.duke.edu>
* makefile, urlgrabber/__init__.py:
Fixed a typo in the new 'release' target. Reformatted the doc in
__init__.py to be under 80 columns in PKG-INFO.
2004-08-21 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
updated ChangeLog
2004-08-21 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/__init__.py:
release 2.9.0
2004-08-21 Michael D. Stenner <mstenner@linux.duke.edu>
* makefile:
Added "release" and "daily" targets to makefile for automated
releases.
2004-08-20 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: test_grabber.py, test_mirror.py:
Updated callback tests to accomodate the new callback semantics.
Also added many new checkfunc tests to for URLGrabber.
2004-08-20 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/: grabber.py, mirror.py:
Changed the callback syntax. This affects failure_callback use in
both URLGrabber and MirrorGroup instances. It also affects
checkfunc use in URLGrabber instances. If you use these features,
you WILL need to change your code.
2004-08-12 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_mirror.py:
enhanced the failover test to ensure that the bad mirror is tried
and fails
2004-08-12 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py, urlgrabber/grabber.py:
fixed typo in _make_callback and added testcase
2004-08-11 Michael D. Stenner <mstenner@linux.duke.edu>
* makefile, test/test_grabber.py, urlgrabber/grabber.py:
Fixed a bug in URLGrabber._retry that prevented failure callbacks
from working. Also added a testcase for failure callbacks.
2004-08-09 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/mirror.py:
fixed documentation typos
2004-07-22 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, urlgrabber/grabber.py, urlgrabber/progress.py:
Committing new progress meter code. This new version includes an
emulation of the old meter with pretty good backward compatibility.
However, if people are using custom progress meters, they might
break. This is a good general warning for the next few versions of
urlgrabber, actually. As the threading stuff evolves, the progress
meter may have to change its behavior a bit.
2004-07-21 Ryan Tomayko <rtomayko@naeblis.cx>
* .cvsignore:
Ignoring kdevelop, kate, and anjuta project files. Ignoring
directories created by distutils: build, dist
2004-07-21 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
opener related items for ALPHA are completed. Moved stuff related
to getting a release out up into ALPHA section.
2004-07-21 Ryan Tomayko <rtomayko@naeblis.cx>
* setup.py, urlgrabber/__init__.py:
Added distutils trove classifiers to setup.py for eventual
inclusion in PyPI. Added general description string for urlgrabber
package (__init__.py).
2004-07-21 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Fixed bug with urllib2.OpenerDirector caching. Added cache_openers
option (pass to any urlXXX function/method) to control
OpenerDirector caching. The default is to cache openers.
2004-04-02 Michael D. Stenner <mstenner@linux.duke.edu>
* makefile, test/test_grabber.py:
Tidied up the grabber tests so we're not always skipping those ftp
reget tests (the just don't happen now). Also, modified the
makefile so that "make test" runs the testsuite with -v 1 for BOTH
python2.2 and 2.3 if they are available.
2004-03-31 Michael D. Stenner <mstenner@linux.duke.edu>
* LICENSE, maint/license-notice, scripts/urlgrabber,
test/grabberperf.py, test/runtests.py, test/test_byterange.py,
test/test_grabber.py, test/test_keepalive.py, test/test_mirror.py,
test/threading/batchgrabber.py, urlgrabber/byterange.py,
urlgrabber/grabber.py, urlgrabber/keepalive.py,
urlgrabber/mirror.py, urlgrabber/progress.py:
Replace LICENSE with the LGPL. Added maint/license-notice, which
contains the corresponding license-snippet that should be included
at the top of each file. Finally, added this snippet to all of the
.py files. urlgrabber is now fully LGPL-ed.
2004-03-31 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
fixed minor typo in opener option docs
2004-03-31 Ryan Tomayko <rtomayko@naeblis.cx>
* test/runtests.py:
Added support for specifying verbosity of unit tests when using
runtests.py. 'python runtests.py --help' for usage.
2004-03-31 Ryan Tomayko <rtomayko@naeblis.cx>
* test/test_grabber.py, urlgrabber/grabber.py:
Added opener kwarg/option to allow overriding of
urllib2.OpenerDirector.
2004-03-31 Michael D. Stenner <mstenner@linux.duke.edu>
* test/munittest.py, test/test_keepalive.py,
urlgrabber/keepalive.py:
Fixed a few bugs related to python 2.3.
2004-03-29 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO:
updated TODO
2004-03-29 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog, makefile:
Modified "make ChangeLog" target so that it doesn't print the
times. If we use local times, then it varies with who rebuilds the
ChangeLog file. Also updated the ChangeLog to the new format.
2004-03-28 Michael D. Stenner <mstenner@linux.duke.edu>
* makefile:
Added "make test" target.
2004-03-28 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Fixed documentation error in the range docs. Temporarily disabled
the opener-caching because it seems to not quite work correctly
yet.
2004-03-28 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_keepalive.py:
Changed the wait time for the dropped-connection test. The apache
keepalive timeout is 15 seconds, so I set the test to 20.
2004-03-28 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
Removed items from TODO list related to options handling as well as
the item on reget not working due to how options are copied.
2004-03-28 Ryan Tomayko <rtomayko@naeblis.cx>
* test/test_grabber.py, urlgrabber/grabber.py:
Modified URLGrabberOptions to use "instance-inheritance" pattern
instead of copying all options.
2004-03-28 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/mirror.py:
Fixed real small typo in mirror doc string.
2004-03-28 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Tidied up some of the callback code.
2004-03-21 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog, TODO, test/threading/batchgrabber.py,
test/threading/urls-keepalive, test/threading/urls-many:
Updated ChangeLog, TODO. Played with the batchgrabber stuff a
little.
2004-03-21 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_keepalive.py:
added a simple threading test
2004-03-21 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/keepalive.py:
Rearranged keepalive a little to make it nicer. Also made it deal
better with dead connections. Now it checks ALL "free"
connections.
2004-03-20 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, urlgrabber/keepalive.py:
OK, I THINK I've made keepalive thread-friendly. The interface is
completely unchanged. It passes all the existing unittests
(although none of them test threading issues). I need to make some
new tests for thread stuff.
2004-03-19 Michael D. Stenner <mstenner@linux.duke.edu>
* test/grabberperf.py:
Made tests a little cleaner and added "none" test, which is a
reasonable "ideal" test case.
2004-03-19 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
Removed item on reducing urllib2 opener director creation.
2004-03-19 Ryan Tomayko <rtomayko@naeblis.cx>
* test/grabberperf.py:
Added proxies to performance tests to ensure that ProxyHandlers are
being added to the urllib2 OpenerDirector.
2004-03-19 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Caching urllib2 ProxyHandler and OpenerDirectors instead of
creating new ones for each request.
2004-03-19 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, urlgrabber/grabber.py:
In grabber.py, fixed a bug with urlread when no limit is specified.
Made the "natural" file translation work for relative pythnames.
2004-03-19 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: base_test_code.py, test_grabber.py:
Added file and ftp tests for reget.
2004-03-18 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: base_test_code.py, munittest.py, runtests.py,
test_byterange.py, test_grabber.py, test_keepalive.py,
test_mirror.py:
By popular demand, I broke out the unittest coolness into a
separate module (munittest.py) which is largely a drop-in
replacement for unittest.py. The whole test setup is just way cool
at this point.
2004-03-18 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: base_test_code.py, runtests.py, test_byterange.py,
test_grabber.py, test_keepalive.py, test_mirror.py:
Made tests prettier (again... I do like pretty tests) and created
the test result "skip" intended for things like ftp tests where
there may not be a server available. None of the available "ok",
"FAIL" or "ERROR" were really appropriate.
2004-03-17 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/mirror.py:
Added a little more documentation.
2004-03-17 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, test/test_mirror.py, urlgrabber/mirror.py:
Made the control of MirrorGroup failover action WAY cooler and more
flexible. Added tests
2004-03-16 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, test/base_test_code.py, test/test_grabber.py,
urlgrabber/grabber.py:
Changed test base_url according to new web page. Added prefix
argument to URLGrabber, along with docs and test. Also documented
URLGrabber failure_callback.
2004-03-15 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO:
updated TODO after alpha-huddle
2004-03-15 Ryan Tomayko <rtomayko@naeblis.cx>
* test/test_grabber.py:
Grabber tests now call package level urlgrab, urlopen, urlread
functions.
2004-03-15 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/__init__.py:
Added urlgrab, urlopen, urlread as package level exports.
2004-03-14 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog, scripts/urlgrabber, urlgrabber/byterange.py,
urlgrabber/grabber.py, urlgrabber/keepalive.py,
urlgrabber/mirror.py:
Added Id: strings to distributed python files, made grabber.py use
the global version information from __init__.py, and updated
copyright statements to include 2004 :)
2004-03-14 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO:
updated TODO
2004-03-14 Michael D. Stenner <mstenner@linux.duke.edu>
* setup.py, urlgrabber/__init__.py:
Put version, date, url, and author information into __init__.py and
have setup.py read them in from there. Also put Id: tag into
__init__.py, which we should probably do in the other files as
well.
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_mirror.py, urlgrabber/mirror.py:
improved mirror.py docs, added MGRandomStart (which is equivalent
to yum's roundrobin failover policy) and MGRandomOrder. Added
associated tests for these new classes.
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_byterange.py:
converted test_byterange.py to use base_test_code
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO:
updated TODO
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: base_test_code.py, test_grabber.py, test_keepalive.py:
making tests a little nicer
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: base_test_code.py, test_grabber.py, test_mirror.py:
added callback test to test_mirror.py. Restructuring the test code
a little. Starting to move common definitions (like the reference
data) into base_test_code.py, which can be accessed from the other
test modules. Also defined UGTestCase which prints things slightly
more prettily.
Also, I'd like to go on record as saying that I absolutely hate the
__attribute munging in python, and I don't think it should ever be
used. Thank you for your attention.
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/: grabber.py, mirror.py:
added "failure_callback" options to both MirrorGroup and
URLGrabber. These allow the calling application to know when a
retry/failover is occurring and what caused it (the exception is
passed to the callback).
2004-03-13 Michael D. Stenner <mstenner@linux.duke.edu>
* test/: runtests.py, test_grabber.py, test_keepalive.py:
added keepalive tests; fixed test_bad_reget_type
2004-03-12 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/grabber.py:
Fixed a stupid typo in the splituser fix (from 5 minutes ago)
2004-03-12 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/.cvsignore:
Added .cvsignore to urlgrabber/ directory
2004-03-12 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO:
Updated TODO.
2004-03-12 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber/: grabber.py, mirror.py:
Fixed a typo in mirror.py. Added reget docs and reformatted
module-level docs in grabber.py. Also fixed a "bug" in grabber.py
that came about because the python 2.3 urllib2 no longer includes
splituser and splitpasswd.
2004-03-12 Ryan Tomayko <rtomayko@naeblis.cx>
* test/test_grabber.py:
Using a tempfile for urlgrab test instead of named file in cwd.
2004-03-12 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Removed ugly code that was suppose to allow a file-object to be
passed to urlgrab as the output file.
2004-03-12 Ryan Tomayko <rtomayko@naeblis.cx>
* test/runtests.py:
Added mirror tests to main test suite.
2004-03-12 Michael D. Stenner <mstenner@linux.duke.edu>
* test/test_grabber.py, urlgrabber/grabber.py:
Added first attempt at reget support. Created test suite for http.
2004-03-11 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog, MANIFEST.in, README, makefile, setup.py,
maint/cvs2cl.pl, maint/usermap:
Updated MANIFEST.in and README. Created 'maint' directory
containing cvs2cl.pl and usermap file (to make usernames to email
address for cvs2cl.pl). Also created a "ChangeLog" target in the
makefile to build the Changelog.
2004-03-11 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, urlgrabber/keepalive.py:
[TODO] added not about restructuring import style
[keepalive.py] improved exception-handling and made it a bit more
fault-tolerant. Specifically, it now better addresses
dropped-connections. Also added a dropped-connection test to the
test code in keepalive.py. For now, I'd like to keep this code in
keepalive.py for convenient diagnostics from users.
2004-03-10 Michael D. Stenner <mstenner@linux.duke.edu>
* TODO, test/test_mirror.py, urlgrabber/grabber.py,
urlgrabber/keepalive.py, urlgrabber/mirror.py:
Added mirror code and associated test code.
[grabber.py] Edited URLGrabError doc string to reflect MirrorGroup
error code and new error code policy.
[TODO] Moved reget to ALPHA (from ALPHA 2)
[keepalive.py] fixed problem with the new python 2.3 httplib. They
now raise BadStatusLine from a new place.
2004-03-10 Ryan Tomayko <rtomayko@naeblis.cx>
* test/: grabberperf.py, test_byterange.py, test_grabber.py:
Fixed up tests to work with new directory layout. Note: most
sys.path hackery has been removed so you will need to setup
PYTHONPATH manually before running test scripts.
2004-03-10 Ryan Tomayko <rtomayko@naeblis.cx>
* setup.py:
Fixed up for new cvs structure. Tried to simplify a bit while I was
here.
2004-03-10 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
Edited the cvs repo to restructure a little bit. All module python
code (grabber.py, byterange.py, progress.py, keepalive.py, and
__init__.py) have been moved into a "urlgrabber" directory.
Removed the ChangeLog file from the repo and created a new empty
one. In the intial (empty) version, I included the old logs from
before the repo-merge. Therefore, those logs appear as a single
log-entry. It's a little bit of a hack, but it's good enough. All
the info is there, and it allows us to move forward without having
to keep special-casing those old logs.
All future changelogs will be generated with cvs2cs.pl.
Other than these actions, I didn't do anything to any files.
Therefore, a number of things are certainly broken, as they expect
things to be in other places. Things I can think of: test code,
setup.py.
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
raise exceptions for invalid ranges DONE.
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Added support for RangeError to URLGrabError handling (errno 9).
Also, default is _not_ to retry grabs resulting in RangeError.
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/byterange.py:
Fixed try/except without an explicit exception. that's a no-no.
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/byterange.py:
raising RangeError anytime a range is non-satisfiable.
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Always copy_local when a byte range is specified to urlgrab.
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
urllib2 conditionals were cleaned up a while ago..
2004-03-08 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
Confirmed FTP connections are being closed properly when using
ranges. Added item for using CacheFTPHandler so FTP connections
are reused. Moved reget item under ALPHA 2. Added items for
keepalive/progress_meter w/ multiple threads. viewcvs DONE. Test
under multiple threads DONE. Basic performance tests DONE.
2004-03-06 Ryan Tomayko <rtomayko@naeblis.cx>
* test/threading/: urls-keepalive, urls-many:
Some URLs for batchgrabber.py.
2004-03-06 Ryan Tomayko <rtomayko@naeblis.cx>
* test/threading/batchgrabber.py:
Module for testing urlgrabber w/ multiple threads.
2004-03-01 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Performance tweak to bypass URLGrabberFileObject.read when not
using progress_obj or throttle.
2004-03-01 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Removed speed test from grabber.py. This test has been moved to
test/grabberperf.py
2004-03-01 Ryan Tomayko <rtomayko@naeblis.cx>
* test/grabberperf.py:
First cut at a module to run through some performance tests for
grabber.py
2004-03-01 Ryan Tomayko <rtomayko@naeblis.cx>
* test/test_grabber.py:
Clean up temp files after testing urlgrab.
2004-02-28 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
Added reget and batch stuff under the 'maybe' section.
2004-02-22 Ryan Tomayko <rtomayko@naeblis.cx>
* urlgrabber/grabber.py:
Brought in fix from yum for user:pass escaping in URLs.
2004-02-22 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO, urlgrabber/grabber.py:
Added checkfunc support to urlread.
2004-02-22 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO, urlgrabber/byterange.py:
Changed all variables/args named 'range' to rangetup or brange to
avoid possible python keyword clash.
2004-02-22 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
Added item about handling KeyboardInterupt exceptions.
2004-02-22 Ryan Tomayko <rtomayko@naeblis.cx>
* TODO:
Restructured TODO list based on recent email from Michael.
2004-02-14 Ryan Tomayko <rtomayko@naeblis.cx>
* .cvsignore, MANIFEST.in, README, TODO, progress_meter.py,
setup.py, urlgrabber.py, scripts/urlgrabber, test/.cvsignore,
test/runtests.py, test/test_byterange.py, test/test_grabber.py,
test/test_urlgrabber.py, urlgrabber/__init__.py,
urlgrabber/byterange.py, urlgrabber/grabber.py,
urlgrabber/progress.py:
Mass commit/merge of changes made for urlgrabber restructuring..
See ChangeLog for details.
2004-02-11 Michael D. Stenner <mstenner@linux.duke.edu>
* ChangeLog:
These are the changes from when ryan was working in a separate
repo. They are included here in bulk for ChangeLog simplicity.
2004-02-09 Friday rtomayko
* TODO: Initial commit of TODO list..
2004-01-29 Thursday rtomayko
* grabber.py, grabbertests.py, urlgrabber: Added proxy support.
* grabber.py: Removed default option copying in URLGrabber class
because this could cause side-effects.
2004-01-25 Sunday rtomayko
* setup.py: Fixed bad syntax on doc files..
* setup.py, urlgrabber: Added urlgrabber script to distutils
install.
2003-12-16 Tuesday rtomayko
* byterange.py, grabber.py: Initial crack at range support for
FTP - working but ugly..
2003-12-12 Friday rtomayko
* byterange.py, byterangetests.py: Make range tuples work like
python slices. endpos is non-inclusive.
2003-12-10 Wednesday rtomayko
* scripts/urlgrabber: Quick script to call urlgrabber from the
command line. I've been using it in place of wget to get as
much testing as possible.
* grabber.py: Fixed small problem with progress meter not
updating properly. This was caused by moving _do_grab to
URLGrabberFileObject.
* grabber.py: Moved _do_open and per-request variables into
URLGrabberFileObject
* grabber.py: Added doc for user_agent kwarg
* grabber.py: Moved urllib2 Handler configuration to
URLGrabberFileObject. Each request has a
urllib2.OpenerDirector now that is customized based on
options.
* grabber.py: Added keepalive kwarg. keepalive can be
specified on a per-request basis now.
* grabber.py: Changed method of handling retries. I'm not
sure if I like this yet. All URLGrabber.urlXXX methods
define a function and pass it to the URLGrabber._retry()
method which calls the function provided and handles
retrying if the provided function raises a URLGrabError.
2003-12-09 Tuesday rtomayko
* grabber.py: default_grabber is now a default instance of
URLGrabber used by the module level urlXXX functions.
* grabber.py: deprecated module level set_<option> methods.
default_grabber should be used instead.
* grabber.py: URLGrabber should be completely stateless now.
Using a single URLGrabber object for multiple requests
shouldn't be a problem.
* grabber.py: Added URLGrabberOptions class to handle kwargs
more elegantly.
2003-12-08 Monday rtomayko
* __init__.py, setup.py: everything should be in a package named
urlgrabber now (distutils is a nice piece of software).
* grabber.py, range.py, rangetests.py, grabbertests.py: - Initial
attempt at range support for HTTP and local files through range
module.
* byterange.py: brought in skeleton classes from urllib2 for FTP
range support but FTP ranges are not yet supported.
* byterange.py: Local file range support internally implemented
with RangeableFileObject which is a pretty cool file object
wrapper that automatically handles ranging.
2003-12-07 Sunday rtomayko
* grabber.py: minor changes for line wrapping
* grabber.py: added URLGrabber class (moved _do_open, _do_grab,
_parse_url, _get_raw_throttle into said class).
* grabber.py: deprecated retrygrab. now supported through
urlgrab(retry=n).
* grabber.py: local filenames (i.e. without URL scheme
specifier) are now handled wherever a url may be passed.
* grabber.py: retry is now fully supported on urlread and
partially supported on urlopen. wrt to urlopen, retries are
possible if an URLGrabError occurs while _opening_ the url
(e.g. due to temporarily unavailable, etc). however, we
still can't protect anyone from problems that occur after
urlopen has returned the file object.
2003-12-05 Friday rtomayko
* grabber.py: Initial attempt at retooling the interface to use
kwargs.. *Most* functionality should still be working..
2003-10-13 Michael D. Stenner <mstenner@linux.duke.edu>
* urlgrabber.py, test/test_urlgrabber.py:
created test_urlgrabber.py and started moving tests into it
2003-10-12 Michael D. Stenner <mstenner@linux.duke.edu>
* README, makefile, progress_meter.py, urlgrabber.py,
urlgrabber/keepalive.py, LICENSE, MANIFEST.in, setup.py:
Initial revision
2003-10-12 Michael D. Stenner <mstenner@linux.duke.edu>
* README, makefile, progress_meter.py, urlgrabber.py,
urlgrabber/keepalive.py, LICENSE, MANIFEST.in, setup.py:
importing urlgrabber