SiteExperts.com Logo Home | Community | Developer's Paradise | Jobs
User Groups | Site Tools | Site Information | Search

Inside Technique : More Legacy Data and the Web: More Steps to a Successful Marriage
By P.C. McGrew, EDPP and W.D. McDaniel, EDPP

Legacy Data and the Web Series
Part 1: Steps to a Successful Marriage
Part 2: More Steps to a Successful Marriage
Part 3: The Adventure Continues
Part 4: More fun with objects
Part 5: Pulling it all together!
Part 6: Getting it to the Web!
Part 7: Getting Graphics to the Web!
Part 8: Getting Fonts to the Web!
Part 9: WAPping it to the Web!
Part 10: Finding Font Solutions in Your Legacy-to-Web Adventure!
Part 11: Do your documents want to go to the web?
Part 12: Evaluating when documents need reformatting.



When last we gathered to discuss the solemn union of the web and your legacy data we looked at the types of applications that might blend into the web easily.  We looked at some files formats and laid a plan for embarking on the project by making lots of lists and then checking them over and over to ensure that everyone involved had a clear picture of how much stuff really moves through your print operation. And not just from the printing point-of-view, but from the set-up point of view as well.  There is some process that makes it possible to merge all of the data into the forms and formats that turn data into bills, invoices, policies, statements, and all of the other business print we deal in. That's the process we have to understand. 

The lists still need a bit of work, though.  Knowing what you print and how it gets to the printer is only part of the story. We also need to know a bit about the components of the things you print, such as  fonts, graphics, external resources like overlays or electronic forms, and the real tricky component -- pre-printed forms. For each of your target applications you are going to need to know the real story on each of these building blocks, and it may not be as easy as it looks!

Let's start with the obvious, the fonts that you see on the paper or on the screen, regardless of where the data originates. The fonts are what the people using the documents see, and most really don't care where they care from or how you got the them into the file. They care that they can read them. Those that do care tend to be the guardians of the corporate logo standards, and they may pose a small challenge.   The application developers who added in fonts containing small graphic objects, signatures, and a few other font pieces may have a couple of words for you as well. And let's not forget environments where there are printers from a variety of vendors, most using their own unique fonts. 

Before this gets overwhelming, let's start with with the basics in most corporate print shops. We're talking about the MIS/IT type of printing here, for the most part, and not the type of printing that goes to the big web presses. Even though these areas are converging, we'll save that for a later column. Our baseline are the Xerox, IBM, Oce', and similar printers that populate most corporate print environments. In reality, your company may own one or more of each vendor's offering as a hedge against being able to switch easily between vendors. Or, you may have acquired the "other vendor's" hardware in a merger/acquisition. We see this all of the time. Let's not worry too much about the variety, let's just get comfortable with everything that you have. 

The primary print file formats you encounter in big corporate environments are Plain Text formats (sometimes with a few twists), the IBM AFP (Advanced Function Printing) Format and the Xerox formats called DJDE (Dynamic Job Descriptor Entry) and Metacode. As we noted above, it isn't unusual for a company to have print files in all of these formats, or to use a third part product to turn one file format into another. The important part to remember for the moment is that all of these types of files use fonts that are in a specific format.  You cannot use the nice Book Antigua from your PC and use it in a corporate report destined for print on the company IBM 3900 AFP printer. The printer and its software will not understand the information in the file.

IBM fonts have variations of their own and you will need to track down which ones you have and use to get ready for a migration to the web. The oldest style of IBM font is the type that supported their oldest line print devices (many of which are still in use).  You'll find these fonts used in files we often refer to as line data the looks remarkably like what you might have produced in a typewriter.  Line data in this format has a few more bells as whistles, which we will discuss in another column. The important part is that it calls fonts, usually by designating their position on a pre-existing font list. Whether it is font three or font four isn't nearly as important to us as what fonts are on that font list, and what format the font files are in.  By format we mean the character set, the resolution, and information about how the intercharacter and interword spacing is handled. In these old type of fonts you will often see names like Elite and Prestige, reminiscent of the old typewriter balls from the early IBM electric typewriters and word processing systems. Note the name of any font you see identified because we are going to need to try to match it to a web font down the road. 

In addition to the oldest of the IBM fonts there are the middle aged IBM fonts that arrived iwth the dawn of AFP printing in the middle 1980's. There are also a set of fonts IBM distributed called the PSF Compatibility Fonts which were designed to allow the users of their newest printers to continue printing the jobs they had already created with little or no change to the code that was in use.  You'll know these fonts by looking for names like GT-10, GT-12, Prestige, OCR-A. These fonts were built with the specific requirements of older data in mind, so they do pose some challenges as we try to migrate to the web.  They next generation, the IBM Sonoran family of Typographic fonts were intended to provide a complete font family that the document designer could use reliably to get great looking documents from their high speed printer.  IBM faced a small challenge in that their printers were running at 240 dots per inch squared while most of their competitors were running at 300 dots per inch squared or more. That meant that IBM had to shave pixels out of their font characters yet still appear to achieve perfect, art-director approvable print. And they did it! But those Sonoran fonts don't have a web-matched version either. While they are based on the standard Arial and New Times Roman, there are real differences which might cause a few formatting problems during migration. 

The latest generation of IBM fonts moved on to Adobe's technology so you will generally find that documents created using fonts with the familiar names like Arial, New Century School Book, and Times New Roman are based on the same masters as those fonts you are familiar with on your desktop.  That makes our job a lot easier. Let stop in the IBM realm with the requirement to make a list of every font you think might be used in a print file.  Talk to your System administrators and find out what fonts are in the test and production font libraries, which will be the best place to start.  If the fonts are in those libraries they cannot end up in one of your documents. 

Now let's look over at Xerox.  Here things look quite a bit different because the folks at Xerox settled on a standard font tape early in their adventure in laser printing.  If you were a North American client you became the proud owner of the A03 tape, and in Europe you received the R03 tape.  These tapes contained font files in standard Xerox font file format for fonts that Xerox thought that you would use to build your forms and documents. They also tried to emulate fonts that they expected you might have used in your line printer data, and tried to provide a sampling of typographic fonts. 

Where the IBM fonts live in a library on that big mainframe that hums along in MIS, the Xerox folks took a different approach and the fonts because part of the system files on each individual machine. The logic was that while IBM printers were true system printers and would always be attached to the mainframe computer, the Xerox printers could be set up to run in what was called offline mode in printer farms. This meant that as long as you kept all of the fonts you owned, including those you created or purchased, on every machine, you could print any job at any time on any machine.  This was a huge selling point, but it is also the place where your life becomes interesting if you have Xerox printers to deal with. 

Let's start with the fact that Xerox was making several types of printers at the time, and they did not share file formats or font formats. These big, high-speed machines sometimes came with a free XES machine, which Xerox would should you how to use for office printing, and sometimes job proofing.  But that's another set of fonts again!  And sometimes jobs began on those printers and were never migrated to the big printers. I think you get the picture. 

So, in the big world of IBM and Xerox fonts, here are the kinds of things you need to be looking for and writing down. 

IBM

Fonts   

Issues   

Web matches 

IBM3800   

Some are at 120 dots per inch, most are designed for  system printing and don't look very good when ported to the web.  

 None

GraphMod   

These fonts were created by systems programmers, usually as logos and signatures for use in check and policy applications.  It is very difficult to convert these to web fonts. The best bet is to alter the program to use a graphic format instead. 

 None

IBM3820   

This is the next generation at 240 dots per inch, mostly based on Adobe original masters and then hand tuned to look good on the IBM 240 dot per inch printers. The Arial and Times New Roman often look good, and some vendors of PC-based fonts have tried their hand at creating usable looking fonts.  The thing to check is always the inter-word and inter-character spacing, especially in applications with rows and columns of numbers.  If the Pi and Specials font is used to access data processing characters you will have some challenges. 

 Arial and Times New Romans

PSF Compatibility Fonts 

These were created by IBM to make migration from their line printers to their high-speed laser printers easier. These fonts have names like GT10, GT12, Prestige, and OCR-A, which match most of those fonts you saw in older line printer applications. The problem is that most of these fonts are fixed-pitch fonts -- each character is the same width.  There is no such thing on the web where all of the fonts expand and contract a bit.  Check any applications relying on rows and columns of numbers to line up very carefully.

 None

Newest Fonts

The new generation of IBM fonts are being built with the assumption that they will be needed for printing and viewing applications. Check your list of fonts carefully and see how many appear to be PC-based fonts.  Do not, however, assume that all of the characters will be included in the the character set.  Remember to check every page that you convert to ensure that currency symbols and other special characters appear in the output.

 None

Unmentionables

For the sake of covering all of the bases, people put all sorts of things into font files. Signatures, flow charts, pie charts, corporate logos, and just about anything else. Remember that any of these special fonts will need some web equivalent, it it may not be easy to get there!

 None

Xerox

Fonts   

Issues   

Web matches 

.FNT   

Xerox fonts live in the world at 300 dots per inch, generally. That is the printer resolution. The printer came with software to permit users to add characters to fonts, combine fonts, and delete standard fonts from the machines, all of which you can find in any given shop. This means that for every Xerox printer, high-speed or low speed, you'll need a list of fonts that are stored on the machine.  IF you can find out what applications are using them, that is great as well.  For many Xerox fonts, especially those purchased from Xerox or one of their third party vendors as a complete font family, locating web matches is very easy.  Most turn out to be standard Adobe fonts.  For anything that was done as a customer font, though, you'l want to be very careful checking the test output of the jobs using the fonts.   

 None

.LGO   

Xerox is unique in that if pre-configured a font format that they called .LGO to handle logos and signatures as fonts on their machines.  That means that where ever you find LGO files you'll be looking for some way to find a web equivalent. 

 None

 

That's enough to overwhelm anyone. Let's take a break here and next month we'll look at graphic formats.  Remember that the goal is to be able to create lists of exactly what you have so that when it comes time to migrate your documents to the web you'll be able to ensure that you have the same types of resources waiting on the web-side of the world. As always, we'll take questions, too! Address them to siteexpert@mcgrewmcdaniel.com.

Discuss and Rate this Article