HTTrack Website Copier release history: -------------------------------------- This file lists all changes and fixes that have been made for HTTrack 3.49-5 + Fixed: MiniZip CVE-2023-45853 + Lintian fixes, multiple build fixes 3.49-4 + New: Push default bandwidth to 100kiB/s, max to 10MiB/s + Fixed: Multiple packaging and build fixes, added security flags 3.49-3 + Fixed: Corrections and updates of the Brazilian Portuguese translations (Paulo Neto) + Fixed: Fixed for Android + Fixed: Add mimetypes for documents (Pablo Castellano) + Fixed: Add firefox-developer-edition (sickcodes) + New: Support data-src and data-srcset for img (Brian 'Redbeard' Harrington) + Fixed: When using the '%N' structure naming key, separate the filename and extension with a dot (Steve Mokris) + Fixed: Multiple build updates 3.49-2 + Fixed: Buffer overflow in output option commandline argument (VL-ID 2068) (Hosein Askari) + Fixed: Minor fixes 3.48-23 + Fixed: on Linux, FTBFS with openssl 1.1.0 3.48-22 + Fixed: on Windows, fixed possible DLL local injection (Tunisian Cyber) + Fixed: various typos 3.48-21 + Fixed: Google RPMs use /usr/bin/google-chrome as program location (Cickumqt) + Fixed: Fixed htsserver not dying (immediately) on quit + New: Updated WIN32 OpenSSL to 1.0.1j (Evgeniy) + Fixed: webhttrack incompatibility with Chrome + Fixed: assertion failure at htslib.c:3458 (strlen(copyBuff) == qLen) seen on Linux + Fixed: infamous crashes inside the DNS cache due to a corruption within the option structure (E.Kalinowski/karbofos) + New: added minimalistic crash reporting on Windows and Linux + Fixed: URL list not working anymore (tom swift) + Fixed: FTBFS on ARM + Fixed: buggy FFFD (replacement character) in place of leading non-ascii character such as Chinese ones (aballboy) + Fixed: FTBFS when compiling with zlib versions < 1.2.70 (sammyx) + Fixed: buggy SVG (Smiling Spectre) + Fixed: do not uncompress .tgz advertised as "streamed" (Smiling Spectre) + Fixed: NULL pointer dereferencing in back_unserialize (htsback.c:976) + Fixed: library development files + Fixed: --advanced-maxlinks broken (Localhost) + Fixed: -devel package should now be standalone + Fixed: assertion failure at htscore.c:244 (len + liensbuf->string_buffer_size < liensbuf->string_buffer_capa) + Fixed: injection-proof templates + Fixed: htshash.c:330 assertion failure ("error invalidating hash entry") (Sergey) + Fixed: Windows 2000 regression (fantozzi.usenet) + Fixed: code cleanup (aliasing issues, const correctness, safe strings) + New: handle --advanced-maxlinks=0 to disable maximum link limits + New: updated ZIP routines (zlib 1.2.8) + Fixed: broken 32-bit version + Fixed: assertion "segOutputSize < segSize assertion fails at htscharset.c:993" + Fixed: new zlib version fixing CVE-2004-0797 and CVE-2005-2096 + Fixed: more reliable crash reporting + Fixed: fixed infamous "hashtable internal error: cuckoo/stash collision" errors + Fixed: safety cleanup in many strings operations + Fixed: buggy option pannels + New: Enforce check against CVE-2014-0160 + New: improved hashtables to speedup large mirrors + New: added unit tests + New: Added %a option, allowing to define the "Accept:" header line. + New: Added %X option, to define additional request header lines. + New: Added option '-%t', preserving the original file type (which may produce non-browseable file locally) + Fixed: remove scope id (% character) in dotted address resolution (especially for catchurl proxy) + Fixed: build fixes, including for Android, non-SSL releases + Fixed: buggy keep-alive handling, leading to waste connections + Fixed: removed chroot and setuid features (this is definitely not our business) + Fixed: removed MMS (Microsoft Media Server) ripping code (mmsrip) (dead protocol, unmaintained code, licensing issues) + Fixed: type mishandling when processing a redirect (such as a .PDF redirecting to another .PDF, with a text/html type tagged in the redirect message) + Fixed: infinite loop when attempting to download a file:/// directory on Unix (gp)
+ Fixed: removed background DNS resolution, prone to bugs + Fixed: do not choke on Windows 2000 because of missing SetDllDirectory() (Andy Hewitt) + Fixed: %h custom build structure parameter not taken in account (William Clark) 3.47-27 + New: support for IDNA / RFC 3492 (punycode) handling + New: openssl is no longer dynamically probed at stratup, but dynamically linked + Fixed: random closing of files/sockets, leading to "zip_zipWriteInFileInZip_failed" assertion, "bogus state" messages, or random garbage in downloaded files + Fixed: libssl.dylib is now in the search list for libssl on OSX (Nils Breunese) + Fixed: bogus charset because the meta http-equiv tag is placed too far in the html page + Fixed: incorrect \\machine\dir structure build on Windows (TomZ) + Fixed: do not force a file to have an extension unless it has a known type (such as html), or a possibly known type (if delayed checks are disabled) + Fixed: HTML 5 addition regarding "poster" attribute for the "video" tag (Jason Ronallo) + Fixed: memory leaks in proxytrack.c (Eric Searcy) + Fixed: correctly set the Z flag in hts-cache/new.txt file (Peter) + Fixed: parallel patch, typo regarding ICONV_LIBS (Sebastian Pipping) + Fixed: memory leak in hashtable, that may lead to excessive memory consumption + Fixed: on Windows, fixed possible DLL local injection (CVE-2010-5252) + Fixed: UTF-8 conversion bug on Linux that may lead to buggy filenames + Fixed: zero-length files not being properly handled (not saved on disk, not updated) (lugusto) + Fixed: serious bug that may lead to download several times the same file, and "Unexpected 412/416 error" errors + Fixed: images in CSS were sometimes not correctly detected (Martin) + Fixed: links within javascript events were sometimes not correctly detected (wquatan) + Fixed: webhttrack caused bus error on certain systems, such as Mac OSX, due to the stack size (Patrick Gundlach) + Fixed: bogus charset for requests when filenames have non-ascii characters (Steven Hsiao) + Fixed: bogus charset on disk when filenames have non-ascii characters (Steven Hsiao) + Fixed: fixed 260-characters path limit for Windows (lugusto) + Fixed: non-ascii characters encoding issue inside query string (lugusto) + Fixed: HTML entities not properly decoded inside URI and query string + Fixed: URL-encoding issue within URI + Fixed: --timeout alias did not work + Fixed: more windows-specific fixes regarding 260-character path limit + Fixed: escaping issue in top index + Fixed: Linux build cleanup (gentoo patches merge, lintian fixes et al.) + Fixed: Fixed div-by-zero when specifying more than 1000 connections per seconds (probably not very common) + Fixed: Mishandling of '+' in URLs introduced in 3.47-15 (sarclaudio) + Fixed: "Wildcard domains in cookies do not match" (alexei dot co at gmail dot com ) + Fixed: buggy referer while parsing: the referer of all links in the page is the current page being parsed, NOT the parent page. (alexei dot com at gmail dot com) + Fixed: Russian translation fixes by Oleg Komarov (komoleg at mail dot ru) + New: Added .torrent => application/x-bittorrent built-in MIME type (alexei dot co at gmail dot com) + Fixed: unable to download an URL whose filename embeds special characters such as # (lugusto) + New: Croatian translation by Dominko Aždajić (domazd at mail dot ru) + Fixed: url-escaping regression introduced in the previous subrelease + Fixed: content-disposition NOT taken in account (Stephan Matthiesen) + Fixed: buggy DNS cache leading to random crashes + Fixed: fixed logging not displaying robots.txt rules limits by default + Fixed: license year and GPL link, libtool fixes (cicku) + Fixed: Keywords field in desktop files (Sebastian Pipping) 3.46-1 * New: Unicode filenames handling * Fixed: fixed bug in handling of update/continue with erased files or renamed files, leading to "Unexpected 412/416 error (Requested Range Not Satisfiable)" and/or "Previous cache file not found" (-1)" errors * Fixed: escape characters >= 128 when sending GET/HEAD requests to avoid server errors * Fixed: do not use "delayed" extensions when the mirror is aborting * Fixed: generate error pages when needed (Brent Palmer) * Fixed: parsing issue with js files due to "script" tags (Vasiliy) * Fixed: anonymous FTP without password (Luiz) * Fixed: Makefile issues regarding parrallel build and examples (Sebastian Pipping) * Fixed: removed deprecated and annoying "Accept-Charset" header in requests (Piotr Engelking) (closes:#674053) 3.45-4 * New: source license is now GPLv3 * New: added a "K5" feature to handle transparent proxies (Brent Palmer) * New: option -y to control ^Z behavior (Julian H. Stacey) * Fixed: replace // by / when saving rather than _/ (Brent Palmer) * Fixed: do not interpret ^C before mirror is finished, or after * Fixed: webhttrack: do not use md5sum to produce a temporary filename, but mktemp (Ryan Schmidt) * Fixed: document %k for custom structure (full query string) 3.45-3 * Fixed: spurious "Previous file not found (erased by user ?)" messages leading to retransfer existing files in cache (Alain Desilets) * Fixed: --max-time now stops the mirror smoothly (Alain Desilets) 3.45-2 * Fixed: number of simultaneous connections was often only one * Fixed: "Unexpected 412/416 error" leading to have broken files on disk 3.45-1 * Fixed: interrupting an update/continue mirror session should not delete anymore previously downloaded content (William Roeder, Alain Desilets and many others) * Fixed: --continue/--update bug leading to download again already cached data in some cases (especially redirect/error pages) 3.44-5 * Fixed: crash when using -d with non-fully-qualified hostname (Alain Desilets) * Fixed: typo in logs (Pascal Boulerie) 3.44-4 * Fixed: random crash when interrupting the mirror (spotted by -fstack-protector) in htscoremain.c (closes:#657878) 3.44-3 + Fixed: Linux build (closes:#657334) 3.44-2 + Fixed: malformed format htslib.c (Moritz Muehlenhoff) + Fixed: default footer print format + New: clever "^C" handling + New: added --do-not-generate-errors option + New: increased maximum cookie name 3.44-1 + Fixed: Randomly corrupted files during updates due to "engine: warning: entry cleaned up, but no trace on heap"/"Unexpected 412/416 error" errors (Petr Gajdusek ; closes:#614966) 3.43-12 + Fixed: buffer overflow while repairing httrack cache if a damaged cache is found from a previous mirror (closes:#607704) 3.43-11 + Fixed: webhttrack fixes for icecat (closes:#605140) 3.43-10 + Fixed: capture URL not working properly when IPv6 is installed (John Bostelman) 3.43-9 + Fixed: application/xhtml+xml not seen as "html" (Peter Fritzsche) + Fixed: various linux fixes for desktop files (closes:#563691) 3.43-8 + Fixed: URL encoding bugs with filenames containing '%' characters (sandboxie32) + Fixed: MacPorts Darwin/Mac fixes to webhttrack (Ross Williams) + Fixed: Flash link extraction has been improved (Vincent/suei8423) 3.43-7 + Fixed: "Open error when decompressing" errors due to temporary file generation problems (Scott Mueller) 3.43-6 + Shell: WIN32 setup cosmetic fixes: do not probe the proxy on non-local network, do not force *.whtt registration 3.43-5 + Fixed: code tag handling bug in certain cases leading to produce invalid links (Tom-tmh13 and William Roeder) 3.43-4 + Fixed: horrible SSL slowdowns due to bogus select() calls (Patrick Pfeifer) + Fixed: Konqueror fixes 3.43-3 + Updated: Portugues-Brasil language file 3.43-2 + Fixed: wizard question buggy, and commandline version did not print it (Maz) + Fixed: do not rename xml subtypes (such as xsd schemas) (Eric Avrillon) 3.43 + Fixed: Fixed too aggressive javascript url= parsing (Chris) + Fixed: fixed --urllist option "sticking" the list content to the list of URL (Andreas Maier) + Fixed: "Previous cache file not found" not redownloading file when deleted before an update (William Roeder) + Fixed: *.rpm.src files renamed to *.src.src with bogus servers (Hippy Dave) + Fixed: "pause" is pausing much faster (William Roeder) + Fixed: binary real media files and related files are no longer being parsed as html (William Roeder) + Fixed: "File not parsed, looks like binary" check no longer corrupt the checked binary file + Fixed: multiple download of error pages (several identical '"Not Found" (404) at link [identical link]') leading to a slowdown in certain cases (William Roeder) + Fixed: sometimes, a double request was issued to update a broken file + Fixed: display bug "link is probably looping, type unknown, aborting .." + Fixed: missing library references at build time and other build related issues (Debarshi Ray) + Fixed: on windows, switched from wsock32.dll to ws2_32.dll + Fixed: minor argument size validation error for "-O" option (Joan CALVET) 3.42-3 + Fixed: Bad URL length validation in the commandline (CVE-2008-3429) (Joan CALVET) 3.42-2 + Fixed: Random crashes at the end of the mirror due to a dangling file pointer (Carlos, angus at quovadis.com.ar) 3.42 + Fixed: size limits are stopping the mirror gently, finishing pending transfers (David Stevenson) 3.41-3 + Fixed: text/plain advertised files renamed into .txt + Fixed: broken configure.in 3.41-2 + Fixed: major WIN32 inlined function bug caused the cache not to be used at all, causing update not to work 3.41 + New: changed API/ABI to thread-safe ones (libhttrack1 2), big cleanup in all .h definitions + Fixed: Major memory usage bug when downloading large sites + Fixed: do not rename files if the original MIME type was compatible + Fixed: several source fixes for freeBSD (especially time problems) + New: option %w to disable specific modules (java, flash..) + Fixed: 'no space left in stack for back_add' error + Fixed: fixed redirected images with "html" type + Fixed: 'Crash adding error, unexpected error found.. [4268]' error 3.40-2 + Fixed: bogus '.del' filenames with ISO-9660 option + Fixed: now merges the header charset even with an empty footer string + New: --port option for webhttrack 3.40 + New: mms:// streaming capture (thanks to Nicolas Benoit!) + New: proxyTrack project released + New: new experimental parser that no longer needs link testing ('testing link type..') + New: Redirect handled transparently with delayed type check and broken links made external when the "no error page" option is enabled + New: improved background download to handle large sites + New: '--assume foo/bar.cgi=text/html' is now possible + New: MIME type scan rules (such as -mime:video/* +mime:video/mpeg) + New: size scan rules now allows to rewrite uncaught links as external links + Fixed: crash fixed when ftime()/localtime()==NULL + Fixed: iso-9660 option now using '_' for collision character + Fixed: collision problems with CaSe SeNsItIvItY + Fixed: a href='..' fixed! + Fixed: redirects are now handled by the new experimental parser + Fixed: "./" links generated with fixed outbound links (../../) + Fixed: 'base href' bogus in many cases + Fixed: enforce security limits to avoid bandwidth abuses + Fixed: bogus external (swf) parser, fixed remaining .delayed files + New: new check-mime and save-file2 callbacks + New: "always delayed type check" enabled + Fixed: totally bogus finalizer causing compressed files not to be uncompressed, and many files to be truncated + Shell: new Finnish interface added! + Fixed: "..html" bogus type + Fixed: remaining bogus .delayed entries + Fixed: flush before user-defined command + Fixed: fixed user-defined command call and background cleaner + Fixed: fixed 'Crash adding error, unexpected error found.. [4250]' error + Fixed: fixed cache absolute file reference (the reference is now relative) preventing the cache form being moved to another place + Fixed: webhttrack 'Browse Sites' path bug + Fixed: old httrack cache format fixes (import of older versions did not work anymore) + Fixed: port fixes in htsnet.h + Fixed: -N option with advanced extraction (bogus "not found" member) + Fixed: javascript: location=URL was not recognized + Fixed: no more character escaping when not needed (such as UTF-8 codes) + Fixed: possibly temporary files left on disk with bogus servers giving compressed content on HEAD reuests + Fixed: URL hack caused unexpected filename collisions (index.html vs INDEX.HTML) + Fixed: "do not erase already downloaded file" option now correctly works (it leaves files linked in the mirror) + Fixed: UCS2 encoded pages are now converted properly into UTF-8 + New: "near" option now also catch embedded (images, css, ..) files + Fixed: bogus chunked multimedia link text files (such as x-ms-asf files) + Fixed: compilation problems on Un*x version 3.33 + Fixed: Bogus redirects with same location in https + Fixed: Bogus file naming with URL hack + Fixed: Extremly slow redirections and empty files + Fixed: Bogus names with directories ending with a "." + New: Number of connection per second can now be.. decimal, to delay even more + New: Enforce stronger ISO9660 compliance + Shell: "URL Hack" in interface + Shell: "Save settings" now rebuild categories + Shell: "Shutdown PC after mirror" option + Shell: Sound at the beginning/end or the mirror (configurable through system sound properties) + Shell: Fixed drag & drop, .url import + Shell: Fixed "wizard" mode (crash) + Fixed: Crash at the end due to unterminated pending threads + Fixed: \ is not anymore transformed into / after the query (?) delimiter + New: Two new callbacks for pre/post-processing html data + New: link-detected2 callback (additional tag name parameter) + Fixed: Broken ISO9660 + Fixed: Crash on file:// links + Fixed: Unescaped ampersands (&) in URLs + Fixed: Transfer hangs introduced in 3.33-beta-2 + Fixed: Display bug "Waiting for scheduled time.." + Fixed: Bug "Waiting for scheduled time.." (NOT a display bug, actually) + Fixed: CaSe SenSiTiViTy bugs with mutliple links reffering to the same URL but using different case styles + Fixed: Failed to build from sources (FTBFS) on amd64 archs because of cast problems (Andreas Jochens) + Fixed: & were converted into   (Leto Kauler) + Shell: Fixed crash with long URL lists (Libor Striz) + Fixed: connection/seconds limiter replugged + Fixed: "no files updated" display bug + Fixed: bogus links encoded with UTF (Lukasz Wozniak) + New: --assume can be used to force a specific script type (Brian Schröder) 3.32 + Fixed: css and js files were not parsed! + Fixed: again broken file:// (infinite loops with local crawls) + Fixed: Bandwidth limiter more gentle with low transfer rate + Fixed: external wrappers were not called during updates/continue + New: additional callback examples + Fixed: overflow in unzip.c fixed + New: tests are now cached for better performances! + New: %r (protocol) option for user-defined structure + Fixed: Broken engine on 64-bit archs 3.31 + New: Experimental categories implemented + New: New cache format (ZIP file) + New: .m3u files now crawled + New: .aam files now crawled + Fixed: Broken ftp + Fixed: Broken file:// + Fixed: Broken cookies management and loading + Fixed: HTTrackInterface.c:251 crash + Fixed: "N connections" means "N connections" even in scan phase + Fixed: javascript:location bug + Fixed: libtool versioning problem fixed + Fixed: More javascript bugs with \' and \" + Fixed: .HTM files not renamed into .html anymore + Fixed: OSX fixes in the Makefile script + New: Default "referer" and "from" fields + New: Full HTTP headers are now stored in cache + Fixed: ftp transfer not logged/properly finalized + Fixed: Missing symbolic link in webhttrack install + Fixed: path and language not saved in webhttrack + Shell: Avoid invalid project names + Fixed: Javascript bug with src= + Fixed: Keep-alive consistency problems on Linux/Unix with bogus servers (SIGPIPE) + Fixed: Parsing bug inside javascript (bogus parsing with empty quotes in function: foo('')) + Fixed: static compiling on Linux/Unix + Fixed: bloated .h headers (internal function definitions) + Fixed: Bogus query strings with embedded ../ and/or ./ + New: Added "change-options" call in the crawl beginning + New: Query arguments now sorted for normalized URL checks (when "url hack" option is activated) + Fixed: Previous dependency to zlib.dll to zlib1.dll + Fixed: Broken static files were not correctly updated with the new cache format + Shell: Launch button in Internet Explorer + Fixed: Crash when dealing with multiple '?' in query string with 3.31-alpha 3.30 + New: Webhttrack, a linux/unix/bsd Web GUI for httrack + New: "URL hack" feature + New: HTTP-headers charset is now propagated in the html file + New: loadable external engine callbacks + New: Experimental ".mht" archives format + Fixed: Query ?? bug + Fixed: Bogus base href without http:// + Fixed: Several javascript bugs + Fixed: UCS2 pages badly detected + Fixed: Build structure change does not redownload files + Fixed: "?foo" URL bug (link with only a query string) fixed + Fixed: ' or " inside non-quoted URLs + Fixed: keep-alive problems with bogus servers + Fixed: Broken .ra files + Fixed: More javascript bugs + Fixed: ftp transfers not properly monitored in the shell + Fixed: various fixes in webhttrack + Fixed: Blank final page in webhttrack + Fixed: Javascript comments (//) are skipped + Fixed: Temporary fix for "archive" bug with multiple java archives + Fixed: Inlined js or css files have their path relative to the parent + Fixed: Unescaped quotes ("") when continuing/updating in commandline mode + Fixed: Null-character in html page bug + Fixed: External depth slightly less bogus + Fixed: Filters based on size bogus ("less than 1KiB" is now functionning) + Fixed: Strange behaviour with filters (last filter "crushed") + Fixed: Bogus downloads when using ftp (unable to save file) + Fixed: Freeze with keep-alive on certain sites due to bad chunk encoding handling + Fixed: Problems with javascript included paths + Fixed: The mirror now aborts when the filesystem is full + Fixed: "No external pages" option fixed + Fixed: Javascript and \" in document.write bug fixed + Fixed: Two memory leaks in temporary file generation, and in link build fixed + Fixed: Bogus compression with non-gzip format + Fixed: Larger range of charsets accepted + Fixed: Bogus robots.txt when using comments (#) + Fixed: Missing MIME types for files such as .ico + Shell: Fixed continuous proxy search + Shell: Fixed missing HelpHtml/ link + Fixed: Overflow in htsback.c:2779 + Fixed: Bogus style and script expressions due to too aggressive parsing + Fixed: Javascript parsing bugs with \" and \' + Fixed: Javascript link detection bugs when comments were inserted between arguments + Fixed: Bug when valid empty gzip content was received + New: More aggressive "maximum mirroring time" and "maximum amount of bytes transfered" options + New: Windows file://server/path syntax handled + Fixed: mht archive fixes + Fixed: Serious bugs with filters given in commandline erased by the engine + Fixed: Bogus parsing of javascript: generated inside document.write() inside javascript code removed 3.23 + New: Keep-alive + New: URLs size limit is now 1024 bytes + New: Bogus UCS2 html files hack + Fixed: base href bugs + Fixed: windows "dos devices" bug fixed + Fixed: dirty parsing now avoids "," + Fixed: "get non-html files near a link" option sometimes caused huge mirrors + Fixed: Bugs if zlib library is not found + Fixed: Bug with "near" and "no external pages" + Fixed: "Link empty" crash + Fixed: Several javascript bugs + Fixed: Keep-alive problems ("unknown response structure") + Fixed: Major keep-alive bug (connection not closed) + Fixed: 8-3 options not working, ISO9660 option improved + Fixed: Bogus links with embedded CR, TAB.. + Fixed: small ../ external link bug fixed 3.22-3 + Fixed: Slow engine due to (too strict) memory checks + Fixed: Overflow in htscore.c:2353 + Fixed: Bogus chunked files with content-length fixed + Fixed: Folders renamed into ".txt" on Un*x platforms bug fixed! + New: Scan rule list (-%S) added + New: Cache debugging tool (-#C) added 3.21-8 + New: Basic Macromedia Flash support (links extraction) + New: Modular design for https, flash parser and zlib + New: Standard autoconf/configure design on Un*x platforms + New: Modular design also on Windows platforms (dll/lib) + Fixed: Text files without extension not renamed "html" anymore + Fixed: Bug with "?foo" urls + Fixed: No chmod 755 on home anymore + Fixed: Stability problems due to bad file structure checks + Fixed: Overflow in GUI/commandline when displaying statistics + Fixed: Directory creation error 3.20-2 + New: HTTPS support (SSL) + New: ipv6 support + New: 'longdesc' added + New: new file 'new.txt' generated for transfer status reports + New: ISO9660 compatibility option + New: empty mirror/update detection improved + New: Update hack now recognizes "imported" files + New: Option to disable ipv4/ipv6 + New: Filters now recognize patterns like -https://* + Fixed: The engine should be now fully reentrant + Fixed: Fixes for alpha and other 64-bit systems + Fixed: Files downloaded twice if not found in cache + Fixed: ftp problems with 2xx responses + Fixed: ftp problems with multiple lines responses + Fixed: ftp %20 not escaped anymore + Fixed: ftp RETR with quotes problems + Fixed: now tolerent to empty header responses + Fixed: hts-log closed + Fixed: Compressed pages during updates + Fixed: Crash when receiving empty compressed pages + Fixed: Random crashes in 'spider' mode + Fixed: bcopy/bzero not used anymore.. + Fixed: various code cleanups + Fixed: Better UTF8 detection + Fixed: External links now work with https and ftp + Fixed: Top index.html corrupted or missing + Fixed: URL list crashes + Fixed: Random crashes with large sites due to bogus naming handler + Fixed: Freezes on some robots.txt files + Fixed: Compressed files not stored + Fixed: SVG fixes + Fixed: Raw HTML responses + Fixed: 406 error workaround + Fixed: Crashes due to binary files with bogus HTML type (not parsed anymore) + Fixed: External https and ftp links broken, relative https links broken + Fixed: Automatic resizing of filter stack + Fixed: Various ampersand (&) elements added + Fixed: https with proxy temporary workaround (direct connection) + Fixed: "base href" with absolute uris + Fixed: stack frame too large on some systems + Fixed: random bad requests due to bogus authentication + Shell: Several fixes, including registration type problems + Shell: "template files not found" fixed 3.16-2 + Fixed: Zlib v1.4 + Fixed: Gzipped files now downloaded without problems (HTTP compression bug) + Fixed: Ending spaces in URLs now handled correctly + Fixed: META-HTTP bug + Shell: Type registration done only once 3.15 + Fixed: Bogus HTTP-referer with protected sites + Fixed: Fatal IO/socket error with large sites (handles not closed) + Fixed: K4 option now works + Fixed: --continue+URL(s) now clears previous URLs + Fixed: Parsing bug with 'www.foo.com?query' + Shell: 'Soft cancel' documented + Shell: 'Kx' options added 3.10 + Fixed: Broken pipes on Linux version + Fixed: Commandline version bug with gzipped files + Fixed: Crash when reaching compressed error pages + Fixed: Bogus html-escaped characters in query strings + Fixed: Files skipped (bogus anticipating system) + Fixed: Crash when showing stats (div by zero) + Fixed: Problems with URLs/redirects containing spaces or quotes + Fixed: Slash added when ~ detected + Fixed: Ugly VT terminal + New: Faster and cleaner mirror interrupt 3.09 + Fixed: Several problems with javascript parsing + Fixed: Elements after onXXX not parsed + New: Source update wrapper + New: Style url() and @import parsed + Shell: Word database and maximum number of links + Shell: Option changes taken in account immediately + Shell: Cleaner installer (registry keys) 3.08 + New: HTTP compression is now supported + New: Faster response analysis + Fixed: External page in html if cgi + Fixed: Mix between CR and CR/LF for comments + Fixed: Top index corrupted + Shell: Better refresh during parsing + Shell: DLL error 3.07 + Fixed: Random crashes with HTTP redirects + New: New rate limiter (should be sharper) + New: Code cleaned up, new htscore.c/httrack.c files 3.06 + Fixed: Redirect to https/mailto now supported + New: Top index/top dir for Un*x version + New: Sources more modular (.so) + New: Quicktime targetX= tags + New: HTTP 100 partially supported 3.05 + Fixed: Non-scannable tag parameters ("id","name",..) + Fixed: Java classes not found when using "." as separator + Fixed: Java classes not found when missing .class 3.04 + Fixed: URLs with starting spaces + Fixed: bogus URLs when using "base href" + Shell: --assume and -%e options included + New: Documentation updated a little 3.03 + New: Parser optimizations, 10 times faster now! + New: New --assume option to speed up cgi tests + New: Option to avoid Username/password storage for external pages + New: Query string kept for local URIs + Fixed: RFC2396 compliant URLs accepted (//foo and http:foo) + Fixed: foo@foo.com not considered as URL anymore + Fixed: Space encoded into %20 in URIs + Fixed: "Unable to save file" bug + Fixed: Corrupted top index.html + Fixed: Cookies disabled with --get + Fixed: Cache bug for error pages 3.02 + Fixed: Pages without title recorded in top index + Fixed: Error with Content-type+Content-disposition + Fixed: backblue.gif/external.html files not purged anymore + Fixed: Encoding problems with files containing %2F or other characters + Fixed: Write error reported for HTML files + New: hts-stop.lock file to pause the engine + New: New install system using InnoSetup 3.01 + New: HTTP real media files captured + Fixed: Bogus statitics + Fixed: Minor fixes 3.00 + New: New interface, with MANY improvements! + New: Better parsing (enhanced javascript parsing, sharper HTML parsing) + New: Faster and more efficient background download system + New: ETag properly handled + New: Optional URL list + New: Optionnal config file + New: New structure options + New: New filters options (size filters) + New: Better password site handling + New: Traffic control to avoid server overload + New: Setuid and Chroot for Unix release + New: limited 64-bit handling + New: .js files are now parsed + New: Single hts-log.txt file, error level + New: New top index.html design + New: "Update hack" option to prevent unnecessary updates + New: Default language sent for mirrors + New: Searchable index + Fixed: Bogus ftp routines (Linux version) + Fixed: Bug that caused to mirror a complete site from a subdir + Fixed: Bug that caused restart to be very slow + Fixed: Bug that caused loops on several query-string pages (?foo=/) + Fixed: Corrupted cache bug + Fixed: Random broken links (pages not downloaded) + Fixed: Shared links problems + Fixed: Bogus URLs with commas (,) + Fixed: Bogus / and \ mixed + Fixed: Bogus addresses with multiple @ + Fixed: Bogus links with %2E and %2F + Fixed: Bogus empty links + Fixed: "Unexpected backing error" bug fixed + Fixed: Files with incorrect size no more accepted + Fixed: Top index.html created even for untitled pages + Fixed: Bogus N100 option (unable to save file) + Fixed: Deadlock when using many hosts in URLs + Fixed: Password stored internally to avoid access errors + Fixed: Fixed /nul DOS limit + Fixed: Bogus -* filter (nothing mirrored) + Fixed: .shtml now renamed into .html + Fixed: Content-disposition without "" + Fixed: External html page for /foo links + Fixed: Username/password % compliant + Fixed: Javascript parser sometimes failed with " and ' mixed + Fixed: Some Range: bugs when regeting complete files + Fixed: Range: problems with html files + Fixed: HTTP/1.1 407 and 416 messages now handled + Fixed: Bogus timestamp + Fixed: Null chars in HTML bug + Fixed: Error pages cache bug + Fixed: Connect error/site moved do not delete everything anymore! + Fixed: Bogus garbage ../ in relative URL + Shell: New transfer rate estimation + Shell: Fixed crash when using verbose wizard + Shell: dynamic lang.h for easier translation updates + Shell: Fixed some options not passed to the engine + Fixed: A lots of minor fixes! 2.2 Note: 3.00 alpha major bug fixes are included in the 2.2 2.02 + New: Cache system improved, compatible with all platforms + New: Update process improved (accurate date) + New: Remote timestamp for files + New: ETag (HTTP/1.1) supported + Shell: Portugese interface available + Fixed: Bug with links containing commas + Fixed: 'file://' bug with proxy + New: Engine a little bit faster + Shell: Some bugs fixed in the interface 2.01 + New: ftp through proxy finally supported! + New: Sources cleaned up + New: Again some new marvelous options + New: Speed improved (links caught during parsing, faster "fast update") + New: Tool to catch "submit" URL (forms or complex javascript links) + Shell: German interface available + Shell: Dutch interface available + Shell: Polish interface available + Fixed: Level 1 bug fixed + Fixed: Still some parsing/structure problems + Fixed: Referer now sent to server + Fixed: Cookies did not work properly + Fixed: Problems with redirect pages + New: Better javascript parsing + Fixed: Problems with URL-parameters (foo.cgi?param=2&choice=1) + Fixed: Problems with ftp + New: ftp transfers are now in passive mode (firewall compliant) 2.00 -- The First Free Software Release of HTTrack! + New: HTTrack sources (command line), now free software, are given + Shell: Interface rewritten! + New: Documentation rewritten + Shell: Drag&Drop abilities + Shell: More URL informations + Shell: Fixed: Remote access problems + Fixed: Loop problems on some sites causing crashes + Fixed: URL encoding problems + Fixed: Some file access problems for ../ + Fixed: Some fixes for updating a mirror + Shell: Crazy progress bar fixed + Fixed: Form action are rewritten so that cgi on form can work from an offline mirror + Fixed: Crashes after continuing an "hand-interrupted" mirror + Fixed: Bogus files with some servers (chunk bug) 1.30 + Shell: Interface improved + New: robots.txt are followed by default + New: Parsing speed improved on big (>10,000 links) sites with an hash table + New: Mirror Link mode (mirror all links in a page) + New: Cookies are now understood + New: No external pages option (replace external html/gif by default files) + New: Command line version improved, background on Unix releases + Fixed: Problems with javascript parsing + Fixed: Username/password not set to lowercase anymore + Fixed: Problems with base href + New: Links in level 1 html files now patched + New: Expurge now deletes unused folders + New: Option -V executes shell command for every new file + Shell: Primary filter now works 1.24 + Fixed: Ftp protocol bogus (with login/pass) + Fixed: Cache problems (corrupted files) + New: Expurge old files for updates + New: "Updated" messages for mirror updates + Shell: Autodial/hangup option to RAS + Fixed: index.html were not created sometimes + Shell: Fixed: Random crashes with the interface + Shell: Fixed: Filters profile not saved + Fixed: Various (and numerous) fixes 1.23 + Shell: Interface improved + Shell: Multiple setups + Shell: Redefine options + Shell: Continue interrupted mirror improved 1.22 + Fixed: Parsing up/down did not work well + Fixed: Several files not catched, bugs fixes + Fixed: Problems with classes (1.21) + New: Transfer rate can be limited (-A option) + Shell: Smooth refresh + New: ftp basic protocol a little bit improved 1.21 + Fixed: Several java classes were not parsed + Fixed: Some folders without ending / ignored + Fixed: Crashes due to content-type too long 1.20 + Shell: documentation! + Fixed: Some problems with 'host cancel' system after timeouts (crashes) + New: Get only end of files if possible (file partially get) + New: New cache system (only HTML stored) + New: User-defined structure possible + New: Also available: french interface + Fixed: Random crashes (div by 0/illegal instruction) with null size files + New: Limited ftp protocol (files only), e.g. +ftp://* now works + Fixed: Some connect problems with several servers or proxies + New: New option, save html error report by default + Shell: Browse and see log files at the end of a mirror + New: Proxy authentication (ex: guest:star@myproxy.com:8080) + Shell: Interface improved (especially during mirror) + Fixed: Ambiguous files are renamed (asp,cgi->html/gif..) + Shell: New test link mode option + New: Site authentication (ex: guest:star@www.myweb.com/index.html) + Fixed: Minor bugs fixed + Shell: See log files during a mirror + Fixed: Some problems using CGI (different names now) + Fixed: Go down/up/both options and filters + Fixed: "Store html first" did not work + New: -F option ("Browser ID") disguise HTTrack into a browser + New: New filter system + Shell: New "Save as default" options + Fixed: "Build options" did NOT work properly! (files overwritten or missing) + Fixed: User agent ID fixed + Shell: Skip options + Shell: Better interface control during mirrors + Shell: InstallShield and Help files + Fixed: Some external links were not filtered sometimes + Fixed: Mirror crash at the end 1.16b + Shell: Really *stupid* bug fixed causing WinHTTrack to be slooow + Fixed: Crash if the first page has no title fixed + Fixed: Bogus options like "Just scan" saved empty files + Fixed: Forbid all links (*) with manual accept did not work + Shell: Filters interface improved 1.16: + New : Java Classes and subclasses are now retrieved! + New: Better JavaScripts parsing + New: Option: Abandon slowest hosts if timeout/transfer too slow + Shell: Interface improved 1.15b + Fixed: Some bugs fixed 1.15: + Shell: Interface improved + New: Robot improved (some files through javascript are now detected!) + New: Improved wild cards (for example, +www.*.com/*.zip) + New: 'config' file to configurate proxy, path.. only once 1.11 + New: Wait for specific time (begin transfer at specific hour) + New: Time limit option (stops transfer after x seconds) + Shell: Interface improved for an easy use 1.10e + Fixed: Maps were not correctly managed (stupid bug) 1.10d: + Fixed: Bogus index.html fixed 1.10c + Shell: "Time out" field needed "transfer rate" field 1.10b + Fixed: Better memory management 1.10 + New: "Transfer rate out" option added (abandon slowests sites) + New: "Deaf" hosts do not freeze HTTrack any more + Fixed: Again problems with code/codebase tags + New: Broken links detection improved 1.04 + Fixed:Some links were not correctly read (pages with "codebase" tags) + Shell: Interface improved 1.03 (No changes for the command-line robot) + Shell: Big bug fixed! (VERY slow transfer rates..) 1.02 + Fixed: Some java files were not correctly transfered + New: Speed has been improved + Fixed: Log file more accurate + Shell: Interface has been improved 1.01 + Fixed: Structure check error in some cases 1.00 -- The 1.00, Yeah! + New: base and codebase are now scanned 0.998 beta-2 + Fixed: Multiple name bug (files having the same name in the same directory) with -O option fixed 0.997 beta-2 + Fixed: Filenames with '%' were not correctly named + Fixed: Bug detected in 0.996: several files are not written on disk!! 0.996 beta-2 + New: -O option (path for mirror and log) + New: Unmodified file time/date are not changed during an update 0.99 beta-2 + New: User-agent field + New: Shortcuts (--spider etc.) + New: Links not retrieved are now rebuilt absolutly + New: The 'g' option (juste get files in current directory) has been added + New: Primary links analyste has been improved + Fixed: "304" bug fixed 0.25 beta-2 + Fixed: Freeze during several mirrors fixed! + New: More 'N' options (filenames type) 0.24 beta-2 + Fixed: Restart/Update with cache did not work (really not..) + Fixed: Wild cards now work properly (e.g. +www.abc.com* do works) + New: The 'n' option (get non-html files near a link) has been added! 0.23 beta-2 + Fixed: The 'M' option (site size) did not work + Fixed: Files larger than 65Kb were not correctly written older beta + Many, many bugs fixed