Package com.aowagie.text.pdf
Class CMapAwareDocumentFont
java.lang.Object
com.aowagie.text.pdf.BaseFont
com.aowagie.text.pdf.DocumentFont
com.aowagie.text.pdf.CMapAwareDocumentFont
Implementation of DocumentFont used while parsing PDF streams.
- Since:
- 2.1.4
-
Nested Class Summary
Nested classes/interfaces inherited from class com.aowagie.text.pdf.BaseFont
BaseFont.StreamFont
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate char[]
Mapping between CID code (single byte only for now) and unicode equivalent as derived by the font's encoding.private final PdfDictionary
The font dictionary.private int
the width of a space for this font, in normalized 1000 point unitsprivate CMap
The CMap constructed from the ToUnicode map from the font's dictionary, if present.Fields inherited from class com.aowagie.text.pdf.BaseFont
ASCENT, AWT_ASCENT, AWT_DESCENT, AWT_LEADING, AWT_MAXADVANCE, BBOXLLX, BBOXLLY, BBOXURX, BBOXURY, BuiltinFonts14, CACHED, CAPHEIGHT, CHAR_RANGE_ARABIC, CHAR_RANGE_CYRILLIC, CHAR_RANGE_HEBREW, CHAR_RANGE_LATIN, charBBoxes, CID_NEWLINE, compressionLevel, COURIER, COURIER_BOLD, COURIER_BOLDOBLIQUE, COURIER_OBLIQUE, CP1250, CP1252, CP1257, DESCENT, differences, directTextToByte, embedded, EMBEDDED, encoding, fastWinansi, FONT_TYPE_CJK, FONT_TYPE_DOCUMENT, FONT_TYPE_T1, FONT_TYPE_T3, FONT_TYPE_TT, FONT_TYPE_TTUNI, fontCache, fontSpecific, fontType, forceWidthsOutput, HELVETICA, HELVETICA_BOLD, HELVETICA_BOLDOBLIQUE, HELVETICA_OBLIQUE, IDENTITY_H, IDENTITY_V, ITALICANGLE, MACROMAN, NOT_CACHED, NOT_EMBEDDED, notdef, RESOURCE_PATH, specialMap, STRIKETHROUGH_POSITION, STRIKETHROUGH_THICKNESS, SUBSCRIPT_OFFSET, SUBSCRIPT_SIZE, subset, subsetRanges, SUPERSCRIPT_OFFSET, SUPERSCRIPT_SIZE, SYMBOL, TIMES_BOLD, TIMES_BOLDITALIC, TIMES_ITALIC, TIMES_ROMAN, UNDERLINE_POSITION, UNDERLINE_THICKNESS, unicodeDifferences, widths, WINANSI, ZAPFDINGBATS
-
Constructor Summary
ConstructorsConstructorDescriptionCMapAwareDocumentFont
(PRIndirectReference refFont) Creates an instance of a CMapAwareFont based on an indirect reference to a font. -
Method Summary
Modifier and TypeMethodDescriptionprivate int
For all widths of all glyphs, compute the average width in normalized 1000 point units.decode
(byte[] cidbytes, int offset, int len) Decodes a string of bytes (encoded in the font's encoding) into a unicode string This will use the ToUnicode map of the font, if available, otherwise it uses the font's encodingprivate String
decodeSingleCID
(byte[] bytes, int offset, int len) Decodes a single CID (represented by one or two bytes) to a unicode String.int
getWidth
(int char1) Gets the width of achar
in normalized 1000 units.private void
Parses the ToUnicode entry, if present, and constructs a CMap for itprivate void
Inverts DocumentFont's uni2byte mapping to obtain a cid-to-unicode mapping based on the font's encodingMethods inherited from class com.aowagie.text.pdf.DocumentFont
charExists, convertToBytes, convertToBytes, getAllNameEntries, getCharBBox, getFamilyFontName, getFontDescriptor, getFullFontName, getFullFontStream, getIndirectReference, getKerning, getPostscriptFontName, getRawCharBBox, getRawWidth, getUni2Byte, getWidth, hasKernPairs, setKerning, setPostscriptFontName, writeFont
Methods inherited from class com.aowagie.text.pdf.BaseFont
addSubsetRange, correctArabicAdvance, createEncoding, createFont, createFont, createFont, createFont, createFont, createFont, createFont, createSubsetPrefix, enumerateTTCNames, enumerateTTCNames, getAllFontNames, getAllNameEntries, getAscent, getAscentPoint, getBaseName, getCidCode, getCodePagesSupported, getCompressionLevel, getDescent, getDescentPoint, getDifferences, getDocumentFonts, getDocumentFonts, getEncoding, getFontType, getFullFontName, getResourceStream, getResourceStream, getUnicodeDifferences, getUnicodeDifferences, getUnicodeEquivalent, getWidthPoint, getWidthPoint, getWidthPointKerned, getWidths, isDirectTextToByte, isEmbedded, isFontSpecific, isForceWidthsOutput, isSubset, normalizeEncoding, setCharAdvance, setCompressionLevel, setDirectTextToByte, setForceWidthsOutput, setSubset
-
Field Details
-
fontDic
The font dictionary. -
spaceWidth
private int spaceWidththe width of a space for this font, in normalized 1000 point units -
toUnicodeCmap
The CMap constructed from the ToUnicode map from the font's dictionary, if present. This CMap transforms CID values into unicode equivalent -
cidbyte2uni
private char[] cidbyte2uniMapping between CID code (single byte only for now) and unicode equivalent as derived by the font's encoding. Only needed if the ToUnicode CMap is not provided.
-
-
Constructor Details
-
CMapAwareDocumentFont
Creates an instance of a CMapAwareFont based on an indirect reference to a font.- Parameters:
refFont
- the indirect reference to a font
-
-
Method Details
-
processToUnicode
private void processToUnicode()Parses the ToUnicode entry, if present, and constructs a CMap for it- Since:
- 2.1.7
-
processUni2Byte
private void processUni2Byte()Inverts DocumentFont's uni2byte mapping to obtain a cid-to-unicode mapping based on the font's encoding- Since:
- 2.1.7
-
computeAverageWidth
private int computeAverageWidth()For all widths of all glyphs, compute the average width in normalized 1000 point units. This is used to give some meaningful width in cases where we need an average font width (such as if the width of a space isn't specified by a given font)- Returns:
- the average width of all non-zero width glyphs in the font
-
getWidth
public int getWidth(int char1) Description copied from class:DocumentFont
Gets the width of achar
in normalized 1000 units.- Overrides:
getWidth
in classDocumentFont
- Parameters:
char1
- the unicodechar
to get the width of- Returns:
- the width in normalized 1000 units
- Since:
- 2.1.5 Override to allow special handling for fonts that don't specify width of space character
- See Also:
-
decodeSingleCID
Decodes a single CID (represented by one or two bytes) to a unicode String.- Parameters:
bytes
- the bytes making up the character code to convertoffset
- an offsetlen
- a length- Returns:
- a String containing the encoded form of the input bytes using the font's encoding.
-
decode
Decodes a string of bytes (encoded in the font's encoding) into a unicode string This will use the ToUnicode map of the font, if available, otherwise it uses the font's encoding- Parameters:
cidbytes
- the bytes that need to be decodedoffset
- index of offsetlen
- index fot the lenght- Returns:
- the unicode String that results from decoding
- Since:
- 2.1.7
-