Copyright Digital Equipment Corp. All rights reserved.

Qualifier_Values

 

SEQUENCE=sequence_type
 

ASCII

   Specifies ASCII collating sequence, which is the default
   sequence.
 

EBCDIC

   Arranges characters according to EBCDIC sequence. The characters
   remain in ASCII representation; only the order is changed.
 

MULTINATIONAL

   Arranges characters according to Multinational sequence, which
   collates the international character set. When you use the
   Multinational sequence, characters are ordered according to the
   following rules:

   o  All diacritical forms of a character are given the collating
      value of the character (A',A",A` collate as A).

   o  Lowercase characters are given the collating value of their
      uppercase equivalents (a collates as A, a" collates as A").

   o  If two strings compare as equal, tie-breaking is performed.
      The strings are compared to detect differences due to
      diacritical marks, ignored characters, or characters that
      collate as equal although they are actually different. If the
      strings still compare as equal, another comparison is done
      based on the numeric codes of the characters. In this final
      comparison, lowercase characters are ordered before uppercase.

   Care should be taken when sorting or merging files for further
   processing using the Multinational sequence. Sequence checking
   procedures in most programming languages compare numeric
   characters. Because Multinational is based on actual graphic
   characters and not on the codes representing those characters,
   normal sequence checking does not work.
 

user-defined-sequence

   Specifies a user-defined collating sequence. Define a collating
   sequence by specifying a string of single or double characters
   or ranges of single characters. (A double character is any set
   of two single characters collated as if they were one character.
   For example, "CH" can be defined to collate as "C".) This string
   should be enclosed in parentheses.

   You can also represent characters by their corresponding octal,
   decimal, or hexadecimal values using the radix operators: %O, %D,
   %X.

   You must observe the following rules when defining your collating
   sequence:

   o  Enclose characters in quotation marks (" ").

   o  Separate each character and character range with a comma, and
      enclose the entire list in parentheses.

   o  Give all the characters appearing in the character keys in
      the sort or merge operation a collating value. Any character
      not given a collating value will be ignored unless the FOLD or
      MODIFICATION options are specified.

   o  Do not define a character more than once.

   o  Do not specify the null character by using quotation marks
      (""). Instead, use a radix operator such as %X0.

   o  Specify quotation marks by enclosing them within another set
      of quotation marks ("" "") or by using a radix operator.
 

MODIFICATION=(character operator character)

   Specifies a change to the collating sequence specified in the
   SEQUENCE option. You can modify the ASCII, EBCDIC, Multinational,
   or user-defined sequence. The sequence being modified must be
   specified with the SEQUENCE qualifier even if the sequence is the
   default (ASCII).
 

character

   Specifies a character in the collating sequence. You can specify
   a single or double character. A double character is any set
   of two single characters collated as if they were a single
   character. Enclose the character in quotation marks.
 

operator

   Specifies the operator used to compare the characters. You can
   specify greater than (>), less than (<), or equal to (=).

   These are the kinds of changes permitted in the MODIFICATION
   option:

   o  A single or double character can be equated to a single
      character that has already been assigned a collating value
      ("a"="A").

   o  A single or double character can collate after a single
      character that has already been assigned a collating value
      ("CH">"C").

   o  A single or double character can collate before a single
      character that has already been assigned a collating value
      ("D"<"A").

   o  A double character can be equated to a previously defined
      double character ("CH" = "SH").

   o  A single character can be equated to a double character
      sequence ("C" = "CH").
 

IGNORE

   Specifies that Sort/Merge ignore a character or character range
   in the collating sequence when making an initial comparison.
   Note that, when tie-breaking takes place, Sort/Merge considers
   the characters specified with the IGNORE qualifier. Tie-breaking
   takes place when two or more strings have compared as equal and
   the Multinational sequence is being used or when two or more
   strings have compared as equal and the TIE_BREAK qualifier has
   been specified.
 

FOLD

   Specifies that all lowercase letters be given the collating value
   of their uppercase equivalents. For ASCII, EBCDIC, and user-
   defined sequences, the lowercase letters are a to z.

   Because the lowercase letters in the Multinational sequence
   already have the collating value of their uppercase equivalents,
   using FOLD is unnecessary.
 

TIE_BREAK

   Specifies whether or not Sort/Merge should use numeric values to
   break any ties between characters that have equivalent values.
   By default, tie-breaking occurs with the Multinational sequence.
   Specifying NOTIE_BREAK overrides this default and ensures that no
   further comparisons are made after the initial comparison.

   A TIE_BREAK option must be specified for the ASCII, EBCDIC, and
   user-defined sequences in order for tie-breaking to occur. TIE_
   BREAK should be used when specifying FOLD or MODIFICATION for the
   these sequences.