Split large text/html files into smaller files. I find it
much faster and more accurate than cut and paste.
You embed commands in the big file telling it which pieces
of it are to go where, then let Splitter do the work. It is
much faster and more accurate than trying to select huge
blocks of text in an editor. You don't accidentally lose or
duplicate text. Keeping files small makes the site more
responsive.
You embed multiple
...
stuff that will end up in the charlie.html file.
...
The text between the and tags is split
off into that named file and the text is removed from the
original file along with the tags. The tags are
case-insensitive, so you can use upper or lower case.
1. Filenames may be absolute or relative, with no quotes or spaces.
2. Tags may be nested, but they must balance (equal number
of and ).
3. Tags are case-insensitive, i.e. may be lower or upper case.
4. Multiple tags may be directed to the same
file, where they will be appended.
5. If the files mentioned in the split tags already exist,
they will be overwritten. If you don't like this, split to temporary
files and then merge them into the existing files as you see fit.
(At some point I will invent a switch to control this behaviour).
6. Anything not inside ... is retained in
the original file. Everything else is removed.
7. To discard text, use ... then
when you are done discard the generated temp.txt file.
8. text not inside ... will be left as is
in the orginal file. All else will removed.
The file being split must be small enough to fit in RAM.
Java array addressing limits the file to 2GB, though other
considerations mean in practice the largest file you can
handle will be smaller still.
To install, Extract the zip download with WinZip, available from
http://www.winzip.com (or similar unzip utility) into any
directory you please, often C:\ -- ticking off the "user
folder names" option. To run as an application, type:
java.exe -jar E:\com\mindprod\splitter\splitter.jar x.html
adjusting as necessary to account for where the jar file is.
where x.html is the file to be split containing the embedded
split commands.
You may process files not encoded with the default encoding by adding
the encoding to the end of the command line, e.g.
java.exe -jar E:\com\mindprod\splitter\splitter.jar -charset=UTF-8 x.html
You may also put multiple files on the command line, also dirs. The -s
switch causes all subsequent subdirs to be processed too. However,
wildcards are not yet supported.
For details on possible encodings and how to tell which
encoding a file is using, see
http://mindprod.com/jgloss/encoding.html
Why the axe icon? It symbolises splitting a file up into
smaller pieces.