2 Replaced charset tables with new ones, published by Unicode Consortium with
3 more permissive license.
4 Fixed some incompatibilities with CLang
5 Fixed lot of segfaults on incorrect or corrupted data
6 Use stdint int types throughout the code, add configure check for stdint.h
9 Now ole Root Entry is searched by entry type rather than by
10 name. It is better fix for same problem which was fixed in
11 0.94.1. Fixed building of catppt postscript docs. It might be
12 not qualify for release, but since I've lost archives and have
13 no hope of recreating distribution with same SHA1 sum from
14 working copy of repository which was left, I prefer to make new
15 release. 0.94.1 Feb 09 2006 Fixed some problems with OLE
16 parsing. Fixed some issues with codepage search introduced in
17 the 0.94. Fixed negative record length in ppt files.
20 Added catppt utility to process powerpoint files. Now
21 xls2csv proprerly recognizes Mac and other codepages which
22 Microsoft refers by numbers and other world by names. Support
23 ansicpg command in RTF. Also lot of small bug fixes 0.93.4 Sep
25 0.93.4 Sep 30 2004. Wedding release.
26 After about a year of existence of version 0.93.3 numerous
27 bug reports was send to me and incorporated into CVS. So it is
28 time to release new one. Fixed lot of bugs concerning rtf
29 parser and xls3csv. Added ability to define customizable page
30 separator for multipage spreadsheets and command line switch to
31 specify desired maximal precission of floating point numbers
32 (default now is output as many digits as it is). Also fixed bug
33 with reading pre-OLE word/write files and text files (Debian
37 It was planned as feature release. It has support for Excel
38 Date formatting, output of blank cells and help window in
39 wordview. Unfortunately, during its development important bug
40 was found in ole parser code. So I have to publish this release
41 real soon after previous 0.93.2 Nov 14 2003
43 Improved performance of OLE parser, fixed problems with
44 unicode chars 0xFF00-0xFFFF in catdoc, rewrite wordview for
45 unicode-aware version of Tcl, with support of displaying text
46 in language different from current locale. Reworked autoconf
50 Fixed numerous bugs in newer OLE and RTF code, including
51 problem with incorrectly interpreting last (incomplete)
52 256-byte block of text as Unicode. Restored support for pre-OLE
53 Word versions, which was accidently lost in 0.93
56 Added proper handling of OLE structure (by Alex Ott).
59 Added RTF parser at last (contributed by Alex Ott). MS-DOS
60 Executable for xls2csv is included. Some code clean up and
64 Added autodetect of output charset from current locale.
65 Fixed handling of RK and MULRK records in xls2csv. No more
66 missing numbers. Fixed long-standing bug with loosing of first
67 8 symbols when recoding text file. I finally began to provide
68 MS-DOS executables for 0.91.x series
70 0.91.5 January 30 2002
71 I finally got to catdoc again. UTF-8 output is added. Just
72 specify utf-8 as output charset.
74 0.91.4 December 30 1999
75 Fixed important bug in xls2csv - improper recognition of
76 numeric cells (as opposed to formula). Fixed segfault when
77 catdoc is used to recode plain text files.
79 0.91.3 December 14 1999
80 Mainly xls2csv fixes - xls2csv now recognizes some options
81 (man page is in sync), added endianess check to configure, so
82 xls2csv compilies correctly out of the box on big-endian
85 0.91.2 October 19 1999
86 This is first verison which includes xls2csv program. Also,
87 some long-standed bugs are fixed and newly-introduced bug when
88 catdoc hangs on broken files. Although these files are not read
89 properly without -b switch. New charset koi8-u is added to
90 distribution. If you want to use it in the stable version, just
91 download it from here and put in the catdoc library directory.
93 New switch -l is added. It causes catdoc to list available
94 charsets in current charset path.
96 0.91.1 October 15 1999
97 As it was expected it was wrong decision to believe
98 information about extended charset from word document header.
99 Now we analyze encoding for each 256-byte page separately
100 (becouse it is possible that first ones would be 8-bit and
101 other 16-bit). When processing non-word files (i.e. plain text)
102 encodings are converted and -u is taken into account, so catdoc
103 can be used as generic character converter, which supports utf8
104 and utf16 (both byteorders) as input. 0.91.0 October 12 1999
106 Implemented new format analyzis. Now most versions of word
107 format as well as MS-Write and rtf are detected. Boundaries of
108 main text stream are also detected, so no more garbage is
109 produced at the end of file 0.90.3 August 11 1999
111 Fixed small OS-specific bugs - broken isspace in Turbo C
112 Under DOS and %x was replaced %i for compatibility with SunOS 4.
115 Artem Chuprina pointed out to segfault error when
116 non-existent charset is specified in command line. It turned
117 out to be silly bug in check_charset function with oneline fix.
118 You can get one-line patch.
121 Top-level Makefile now uses $MAKE instead of make
122 fixed missing end-line escaping in wordview.tcl
123 All occurences of strcpy, strcat and sprinf investigated
124 to avoid buffer overflows.
126 Fixed bug with charset names redeclared locally in main()
127 Fixed problem in configure with wish 8.0.3
128 Catdoc considered to be stable enough for release
130 Fixed handling of 0x1F char (soft hyphen in Word 6.0),
131 now it is translated to 0x00AD (unicode soft hyphen)
132 Fixed permissions for manual page
133 Added --with-install-root configure arg to simplify
134 building of binary packages.
135 0.90b4 September 17 1998
136 Added proper configuration of library dir in wordview.
137 Added --disable-charset-check config option
138 Added 0x2026 symbol in ascii.rpl
139 Added more Windows codepages in distribution
140 0.90b3 September 11 1998
141 Added -x switch to simplify debugging of substitution maps
142 0.90b2 September 10 1998
143 Added some symbols is 0x2000-0x20FF range to substituton maps
144 These symbols occurs in cp1251 so they are frequently found
145 in Word files. Fixed some filename-handling problems in
148 0.90b1 September 8 1998
149 Added us-ascii.charset, fixed small bugs in confugre,
150 install is used for all installation files. Code is
151 considered stable enough to be beta.
153 0.90a3 September 7 1998
154 Fixed small bug in table handling, which caused catdoc to
155 output extra column delimiter just before row delimiter. Added
156 autoconf configuration. install is back, although not for
159 0.90a2 August 18 1998
160 version 0.90 was tested on BSDI and Solaris platform. Makefile
161 was rewritten to avoid use of highly incompatible
162 /usr/{ucb,bin}/install
164 0.90a1 August 13 1998
165 Catdoc undergone major rewrite. Now it has proper charset
166 handling, including UNICODE and runtime configurability.
169 Fixed bug with -s switch which prevents catdoc from returning
170 non-zero code when invoked on UNIX text file
173 Files now opened in binary mode thus allowing catdoc to work on
174 DOS and simular systems. All specs arrays now have terminating
178 Fixed missing terminating NUL in specs array, which caused
179 random seqfaults on Linux and many other systems, becouse
180 _specs_ is searched by _strchr_ fynction
183 First mayor public release, uploaded to CTAN. Tk interface
184 appeared, manual page was written. Unfortunately, this release