RFC0183: EBCDIC Codes and Their Mapping to ASCII

Download in PDF format Download in text format






Network Working Group                                        J.M. Winett
Request for Comments: 183                             Lincoln Laboratory
NIC: 7127                                                  July 21, 1971

Categories: D.2, D.3
Related: 109, 110, 105, 158


              The EBCDIC Codes and Their Mapping to ASCII

Abstract

   The uniquely map the ASCII codes into corresponding EBCDIC codes in a
   consistent manner throughout the ARPA Network, this RFC describes and
   defines the IBM Standard Extended BCD Interchanged Code.

Introduction

   The IBM Corporate Systems Standard, Extended BCD Interchanged Code
   (EBCDIC) defines 8-bit graphic and control codes (See Figure 1).  The
   basic EBCDIC code consists of 54 controls (including space) and 88
   graphics.  This set is extended to include 10 special graphics and 1
   special control (EO).  These special graphics originate from the 7-
   bit hollerith code and include 6 ASCII graphics.  The EBCDIC code is
   further extended to include the publishing and printing graphics
   option which specifics 52 graphics.  Of these graphics, 32 appear on
   the IBM TN print chain.  Four of these graphics are duals with
   graphics not on the TN print chain, and one graphic (degree) is dual
   with a graphic in the special graphics set of the basic code (tilde).

   It is desirable to uniquely map the ASCII codes into corresponding
   EBCDIC codes in a consistent manner throughout the ARPA network.

   For each of the 34 ASCII controls (including space and delete) there
   is a corresponding BDCDIC control (assigning ASCII control DC3 to the
   EBCDIC code X'13').  For 85 of the 94 ASCII graphics, there is a
   corresponding graphic in the basic EBCDIC set.  Three different
   correspondences can be made for the other 9 ASCII graphics.













Winett                                                          [Page 1]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


I.  IBM Correspondence

   a) IBM recommends the following ASCII duals with the basic EBCDIC
      graphics.

           ASCII                   EBCDIC                  Code
           [                       [cent sign]             X'4A'
           ]                       !                       X'5A'
           !                       l                       X'4F'
           [carrot sign]           [upper right corner]    X'5F'

   Note that the EBCDIC graphic for exclamation point (!) is not chosen
   to correspond to the ASCII for exclamation point (!), though this
   would be a sensible choice, and thus another code must be used to
   represent this graphic.

   b) Special EBCDIC graphics would be used to represent the other ASCII
      graphics.

           Graphic                         Code
           l                               X'6A'
           l                               X'79'
           [diagonal slash]                X'A1'
           ~                               X'E0'
           [diagonal slash]                XC0'
           {                               XD0'
           }

II.  Publishing Correspondence

   a) Associate the following special EBCDIC graphics with the
      corresponding ASCII graphics.

           Graphic                         Code
           [carrot]                        X'71'
           [                               X'AD'
           ]                               X'BD'
           {                               X'8B'
           }                               X'9B'

   The codes for open bracket and close bracket are chosen since these
   graphics appear on the TN print chain.  The codes for left brace and
   right brace are chosen rather than the codes in the special graphics
   set for opening brace and closing brace, respectively, since these
   graphics are similar and also appear on the TN print chain.






Winett                                                          [Page 2]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


III.  Graphical Correspondence

   a) Associate the following basic EBCDIC graphics with the indicated
      ASCII graphics because of their graphic similarity.

           |                       X'4F'           with |
                                                        |

           [upper right corner]    X'5F'           with ~

   b) Associate the basic EBCDIC graphic for cent with the ASCII graphic
      for reverse slash.

           [cent]          X'4A'           with [diagonal slash]

   This choice is made since the cent graphic is not an ASCII graphic
   and is the only graphic in the basic EBCDIC set which would not
   otherwise be associated with any ACII graphic.

   c) Associate the special EBCDIC graphic grave accent.

         `  X'79'

      with the corresponding ASCII graphic.

   d) Associate the following publishing EBCDIC graphics with the
   corresponding ASCII graphics.

           [carrot]        X'71'
           [               X'AD'
           ]               X'BD'
           {               X'8B'
           }               X'9B'

   The codes for open bracket and close bracket are chosen since these
   graphic appear on the TN print chain.  The codes for left brace and
   right brace are chosen rather than the codes in the special graphics
   set for opening brace and closing brace, respectively, since these
   graphics are similar and also appear on the TN print chain.

   Standards:

   In order that the mapping from ASCII into EBCDIC and vice versa could
   become standardized, I would appreciate comments on the above from
   each site whose operating system uses EBCDIC as the internal code.






Winett                                                          [Page 3]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


   Telnet Codes:

   For those sites who may wish to provide our use TELNET services that
   communicate using an EBCDIC code, a standard code must be specified.
   The codes given in Figure 1 can form the basis for a standard.
   Specific codes must also be specified for the TELNET control codes.
   The following are suggested:

                                   Hex Code
           sync                    38
           break                   39
           NOP                     3A
           Return to ASCII         FF
           No echo                 14
           Echo                    23
           Hide input              24

   To eliminate using one code for two graphics, I propose that the TN
   graphics be associated with their corresponding code.  The graphic
   tilde (~) might be assigned to the code X'E1' rather than keeping the
   dual with the graphic for degree.  This would have no effect if the
   Graphical Correspondence were chosen for the EBCDIC to ASCII mapping
   with the code X'5F' for logical not associated with tilde.  The other
   graphics of the publishing and printing option (Double Acute,
   Inferior Hook, Macron, and Inferior Comma) which are not on the TN
   print chain but have the same codes as graphics on the TN print chain
   would not be considered to be part of the standard EBCDIC code.
























Winett                                                          [Page 4]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


EBCDIC Questionnaire

   1. For ASCII to EBCDIC mapping of the 9 special ASCII graphics do you
      prefer:

      a)   The IBM correspondence             _____________
      b)   The Publishing correspondence      _____________
      c)   The Graphical correspondence       _____________
      d)   Another correspondence (describe)  _____________

   2. Do you concur with the definition of the standard EBCDIC code,
      including TELNET control codes?

           YES __________  NO __________

      Comments:
      __________________________________________________________________
      __________________________________________________________________
      __________________________________________________________________

   3. Please list for your operating system:

      a)   graphics not included in the complete EBCDIC code.
      b)   Graphics given a different code.
      c)   Controls given one of the graphic codes.
      d)   Controls given one of the control codes but defined to be a
           different control
      e)   All the controls which have meaning with your operating
           system (i.e., for which special action is taken) and state
           the action.

   Reply from:     Name                    _________________
                   Telephone               _________________
                   Site                    _________________
                   Host Computer           _________________

   Send to:        Joel M. Winett
                   M.I.T. Lincoln Laboratory
                   Room C-151
                   Lexington, Mass. 02173

   Or call:        (617) 862-5500 ext. 7474

   Figure 1.  [Please view the PDF version of this RFC.]

   Figure 2.  [Please view the PDF version of this RFC.]





Winett                                                          [Page 5]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


   Hex Code      Category      Control      Name
   00            CC            NUL          Null
   01            CC            SOH          Start of Heading
   02            CC            STX          Start of Text
   03            CC            ETX          End of Text
   04            DC            PF           Punch off
   05            FE            HT           Horizontal Tab
   06            GR            LC           Lower Case
   07            GR            DEL          Delete
   08            GR            GE           Graphic Escape
   09            FE            RLF          Reverse Line Feed
   0A            CC            SMM          Start of Manual Message
   0B            FE            VT           Vertical Tab
   0C            FE            FF           Form Feed
   0D            FE            CR           Carriage Return
   0E            GR            SO           Shift Out
   0F            GR            SI           Shift In

   10            CC            DLE          Data Line Escape
   11            DC            DC1          Device Control 1
   12            DC            DC2          Device Control 2
   13            DC            TM/DC3       Tape Mark/Device Control 3
   14            DC            RES          Restore
   15            FE            NL           New Line
   16            FE            BS           Backspace
   17            DC            IL           Idle
   18            GR            CAN          Cancel
   19            DC            EM           End of Medium
   1A            DC            CC           Cursor Control
   1B            CU            CUI          Customer Use 1
   1C            IS            IFS          Info. Field Separator
   1D            IS            IGS          Info. Group Separator
   1E            IS            IRS          Info. Record Separator
   1F            IS            IUS          Info. Unit Separator

   20            ED            DS           Digit Select
   21            ED            SOS          Start of Significance
   22            ED            FS           Field Separator
   23                                       (Reserved)
   24            DC            BYP          Bypass
   25            FE            LF           Line Feed
   26            CC            ETB          End of Text Block
   27            GR            ESC          Escape
   28                                       (Reserved)
   29                                       (Reserved)
   2A            DC            SM           Set Mode
   2B            CU            CU2          Customer Use 2
   2C                                       (Reserved)



Winett                                                          [Page 6]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


   2D            CC            ENQ          Enquiry
   2E            CC            ACK          Acknowledge
   2F            DC            BEL          Bell

   30                                       (Reserved)
   31                                       (Reserved)
   32            CC            SYN          Synchronous Idle
   33                                       (Reversed)
   34            DC            PN           Punch On
   35            DC            RS           Reader Stop
   36            GR            UC           Upper Case
   37            CC            EOT          End of Transmission
   38                                       (Reserved)
   39                                       (Reserved)
   3A                                       (Reserved)
   3B            CU            CU3          Customer Use 3
   3C            DC            DC4          Device Control 4
   3D            CC            NAK          Negative Acknowledge
   3E                                       (Reserved)
   3F            GR            SUB          Substitute

                    Figure 3: EBCDIC Control Functions





























Winett                                                          [Page 7]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


   CC    ([illegible] Control). A functional character  [illegible] to
         control or facilitate transmission of introducing [illegible]
         communication networks.

   FB    (Format Bisector). A functional character which controls the
         layout of positioning or information in printing or display
         devices.

   IS    (Information Separator). A character which is used to separate
         and qualify information in a logical sense. There is a group of
         four such characters, which are to be used in a hierarchical
         order.

   DC    (Device Control). A functional character used for the control
         of ancillary devices associated with data processing of
         telecommunication systems, more especially switching devices
         "on" and "off".

   ED    (Edit and Mark). A control character used by the
         System/[illegible]...and Mark ([illegible]) instruction for the
         formatting of alphanumeric fields.

   GH    (Graphic Control). A control character indicating that the core
         combinations which follow are to be [illegible] in a particular
         code table, depending upon the particular control character.

   CU    (Customer Use). A character excluded from future assignment by
         IBM.  These "protected" codes are intended for use by customer
         systems so that their use will not conflict with a possible
         future IBM use.

                                 Figure 4
                      Categories of Control Functions


















Winett                                                          [Page 8]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


      Hex Code             Graphic                 Name
   *   6A                  |                       Vertical Line
                           |
   *   79                  [diagonal slash]        Grave Accent
   *   A1                  ~                       Tilde
   *   C0                  {                       Opening Brace
       CC                  [hook]                  Hook
       CE                  [fork]                  Fork
   *   D0                  }                       Closing Brace
   *   E0                  /                       Reverse Slant
       EC                  [chair]                 Chair
       FA                  |                       Long Vertical Line
       FF                  EO                      Eight Ones

                     Figure 5: Special EBCDIC Graphics

   *ASCII Graphic


































Winett                                                          [Page 9]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


        Hex Code   Graphic                 Name
   **      A0      -                       Superscript Minus
   *       A1      [degree]                Degree
   *       B0      [superscript 0]         Superscript Zero
   *       B1      [superscript 1]         Superscript One
   *       B2      [superscript 2]         Superscript Two
   *       B3      [superscript 3]         Superscript Three
           B4      [superscript 4]         Superscript Four
           B5      [superscript 5]         Superscript Five
           B6      [superscript 6]         Superscript Six
           B7      [superscript 7]         Superscript Seven
           B8      [superscript 8]         Superscript Eight
           B9      [superscript 9]         Superscript Nine

           SB      {                       Left Brace
           SC      [equal or less than]    Equal or Less Than
           SD      [superscript (]         Superscript Left Parenthesis
           SE      [superscript +]         Superscript Plus Sign
           SF      +                       Plotting Cross
           9B      }                       Right Brace
           9C      [lozenge]               Lozenge
           9D      [superscript )]         Superscript Right Parenthesis
           9E      [plus or minus]         Plus or Minus
           9F      [histogram]             Histogram
           AB      [lower left corner]     Lower Left Corner
           AC      [upper left corner]     Upper Left Corner
           AD      [                       Open Square Bracket
           AE      [= or >]                Equal or Greater Than
           AF      [bullet]                Bullet (Plotting Circle)
           EB      [lower right corner]    Lower Right Corner
           EC      [upper right corner]    Upper Right Corner
           ED      ]                       Close Square Bracket
           EE      [not equal]             Not equal
           EF      ---                     Entended Dash

                Figure 6: Publishing and Printing Graphics
                          Also on the TN Print Chain

   **      Dual with the special EBCDIC graph c tilde
   *       Dual with another graphic which is not on the TN print chain











Winett                                                         [Page 10]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


       Hex Code    Graphic                 Name
           70      [Scandinavian accent]   Scandinavian Accent
           71      [carrot]                Circumflex
           72      [diaeresis]             Diaeresis
           73      /                       Diacritical Virgule
           74      '                       Acute Accent
           75      [superior .]            Superior Dot
           76      ,                       Cedilla
           77      [breve]                 Breve
           78      [caron]                 Caron
           8A      [up arrow]              Up Arrow
           9A      [dagger]                Dagger

   *       B0      "                       Double Acute
   *       B1      '                       Inferior Hook
   *       B2      -                       Macron
   *       B3      ,                       Inferior Comma

           CD      '                       Open Quote
           DB      [pound sign]            Pound Sign
           DC      [section sign]          Section Sign
           DD      [paragraph sign]        Paragraph Sign
           ED      '                       Close Quote

             Figure 7: Publishing and Printing Graphics
                       not on the TN Print Chain

   *       Dual with another graphic which is on the TN print chain























Winett                                                         [Page 11]

RFC 183         EBCDIC Codes and Their Mapping to ASCII    July 21, 1971


   Name        Graphic    Hex Code     Graphic       Name
   Tilde           ~       A1         [degree]       Degree
   Double Acute    "       B0        [superscript 0] Superscript Zero
   Inferior Hook   '       B1        [superscript 1] Superscript One
   Macron  -               B2        [superscript 2] Superscript Two
   Inferior Comma ,        B3        [superscript 3] Superscript Three

                      Figure 8: Graphic Duals
   __________________________________________________________________

   Codes           Graphics               Name
   AF--75             .                   Bullet--Superior Dot
   8B--C0             {                   Left Brace--Opening Brace
   9B--D0             }                   Right Brace--Closing Brace
   61--73             /                   Slash--Diacritical Virgule
   A1--7.0         [degree]               Degree--Scandinavian Accent
   4F--FA             |                   Logical Or--Long Vertical Mark
   6B--76--B3         ,                   Comma--Cedilla-Inferior Comma
   60--B2             -                   Dash--Macron

                       Figure 9: Similar Graphics
   ____________________________________________________________________

   Name            Control  Hex Code  Graphic      Name
   Group Mark        GM      4F          |         Logical or
   Mode Change       MC      5F      [upper right corner] Logical Not
   Plus Zero         PZ      C0          {         Opening Brace
   Minus Zero        MZ      D0          }         Closing Brace
   Record Mark       RM      E0          \        Reverse Slant
   Word Separator    WS      6D          __        Underscore
   Segment Mark      SM      6F           ?        Question Mark
   Substitute Blank  SB      7A           :        Colon
   Tape Mark         TM      7F           "        Quotation Marks

                     Figure 10: Graphic Control Duals
   ____________________________________________________________________

   This material has not been reviewed for public release and is
   intended only for use with the ARPA network.  It should not be quoted
   or cited in any publication not related to the ARPA network.











Winett                                                         [Page 12]