• Please review our updated Terms and Rules here

Print set naming question

Bitly

Experienced Member
Joined
Sep 1, 2016
Messages
255
Location
Westminster, Colorado
I've finally (re)started my scanning project, I've got about 3' of schematics to go through, but I ran into an issue with the first one. I've got five copies of the DZ11 print set, three unique ones.

Two have identical first pages (Rev B, 1978) but different third pages (one is Rev F/1978 and the other is Rev J/1979). I've only included a couple pages, but there are more differences later on. the 1979 date is from page 1 of the schematics.

The general bitsavers naming convention is: MPxxxxx-Title-Rev-Date.pdf where the revision and date are from the first page.

So, what would the community suggest for an updated naming convention? I've got three options, but I don't really like any of them.

This seems cluttered, and might be unworkable for multi-board print sets:
MP00132-DZ11-Rev_BF-1978 & MP00132-DZ11-Rev_BJ-1979

I don't like dropping the revision letter altogether:
MP00132-DZ11-1978 & MP00132-DZ11-1979

This looks too similar to bitsavers and might cause confusion:
MP00132-DZ11-Rev_F-1978 & MP00132-DZ11-Rev_J-1979

CW
View attachment Page0001.png
View attachment Page0003.png
View attachment Page0003.png
 
This seems cluttered, and might be unworkable for multi-board print sets:
MP00132-DZ11-Rev_BF-1978 & MP00132-DZ11-Rev_BJ-1979
IMO when it comes to encoding metadata in filenames "clutter" is not an appropriate criterion.
Reduction in "clutter" necessarily reduces the amount, and utility, of the metadata encoded.
I would always err on the side of "clutter" with the strategic use of visual separators to result in more-or-less human readable results.
Here IMO it's OK to not add yet another separator, as *that* would would be "clutter", but it's more a point of style than correctness!

IMO when handling multi-board/version print sets either:

1. Separate them into per-module/part for maximal search, and use, utility.
2. Label them by the published cover/consolidated title.
3. Both!

In the present case #2: MP00132-DZ11-A,B,E-RevB-1976 (Order#-Title-Revision-Date; "FMPS" from the formal title is implied by the Order#)

#3 at your discretion :->.
 
Maybe scan each version and create an explanatory text file and create a zip file of "DZ-11 Schematics 1978-79 - 3 versions" ??

Great work in any case...

Robin
 
Tangential question: scanning from paper or microfiche?
Paper right now, I don't think there is anything not on bitsavers but I'm targeting higher resolution and more (manual) cleanup. I'm still figuring out how to build a microfiche scanner (using a 3d printer and raspberry pi) or automating a MIC-9.

CW
 
Paper right now, I don't think there is anything not on bitsavers but I'm targeting higher resolution and more (manual) cleanup. I'm still figuring out how to build a microfiche scanner (using a 3d printer and raspberry pi) or automating a MIC-9.

CW
Schematics are going to need a lot of manual cleanup because the OCR process doesn't work well with signal names. DEC also use different signal names for the two states of the same signal. That makes it very challenging to check how well the OCR worked.
 
Mostly I'm cropping, deskewing, and despeckling the pages before converting to B&W. Too aggressive on the despeckle and you remove decimal points and other useful data, but not aggressive enough results in a lot of manual cleanup.

OCR is generally crap since the fonts DEC used (and handwriting) are not well detected. I'm looking for a non-adobe PDF editor, or an older OCR that is less optimistic. Tesseract looks at a line of periods and decides it's a string of 'c's and 'e's in a really tiny font.

CW
 
I've never seen it documented before, but I figured out:

TC - Table of Contents, typically the first page of a print set
DD - Drawing Directory, which seems to list out all the drawings for a single unit
PL - Parts List
UA - Unit Assembly, component locations, board layouts, and drill guide (sometimes)
BD - Block Diagram
CS - Circuit Schematic

The page number doesn't seem to be used in the print sets I've seen. The pages usually end with '-0-1'

CW
 
Back
Top