word2x developer's corner
If you hack code in your sleep, think spending lots of time
using GDB is good fun, and have a machine a tad higher spec than
mine :-), then word2x development can absorb all your spare time
and more. (An aging 486DX2/50 does not help doing a quck recompile
before it dumps core again).
Links
How do I get the latest development code?
You can get the latest code by anonymous ftp (and http and CVS)
from the word2x
project on sourceforge. Alternative there is a local copy on this
machine.Be aware that this code mwill not a working executable,
have extreme instability if you make it compile, etc,
etc. Developers are expected to help me fix these problems, add
cool new features, etc. This archive is not designed or recomended
for end users.
The current bug list is
- Removing the cerr<< stuff in reader/reader.cc seems to
cause core dumps in the dequeue operation when
reader/mutliplex.cc calls the read() method. Any brilliant ideas
why? (glibc 2.0.6, libg++ 2.8.1, bunutils 2.8.1 and g++ 2.8.1 on
my system).
- reader/eqnarr.cc and reader/sections.cc need finishing.
- The LaTeX and HTML output code needs updatuing
- Lots of Makefiles need updating
Please note this list may be out of date due to other things
getting in the way and developers fixing the problems :-)
New features
The next release is completely resdesigned due to the
completely dire choas that was affecting the previous
release. Word97's unicode document were rasing their menacing head
leading to the need to for more support. Thus some internal
infrastructure and major code cleanup was required. Already done
are:
- Junk stripping code that understands unicode documents.
- Modular reader stages that can be combined flexibly.
- Modular command line option processing.
- The ability to tune all the tunable stuff from the command line.
- Ability to specify custome processing on the command line.
- A module that prints the passing tokens.
- A module that queues tokens and forwards them in large
batches (useful in conjunction with the module above).
- Better table processing. However it could still be improved.
- Some programming documentation.
As time progresses word2x increasingly simulates the resource
hog nature of word. I think the overall problem is AI-complete
:-)
Further jobs
If you feel unable to do any of the jobs above, here is the
rest of the endless list of jobs you might like to try. It is not
exhaustive or even close.
- libole needs upgarding to a usable state
- Some binary junk undersatnding readers would be nice for
accurate conversion.
- Backend functionality that would benefit all backends should
be changed into reader stage(s). IT is not cheating to capture
something in more that one stage (the new table code is 2 , or
arguably 3).
- wmf to eps, bmp to pnm, and MS eqaution to *TeX code. Finish
libole before starting these.
- *roff fancy ascii text output.
- Maybe *roff source output.
- Support for using
getopt
instead of
getopt_long
, at a cost to functionality (partial).
- Fix all the bugs.
- Update the makefiles.
- Foolproof operation.
- Write an RTF understanding input stage.
- Applixware input and output for those that use that package.
Return to the Home page
Duncan Simpson
Last modified: Thu May 11 20:37:04 Europe/London 2000