Compaq Fortran
Release Notes for Compaq Tru64 UNIX Systems

Contents

1.12 Additional Information

This section contains information that supplements the Compaq Fortran documentation.

1.12.1 The Compaq Fortran Home Page

If you have Internet access and a World Wide Web (WWW) viewer, you are welcome to view the following:

The Compaq Fortran home page, located at the following URL:
http://www.compaq.com/fortran
The Compaq Computer Corporation home page, located at the following URL:
http://www.compaq.com

1.12.2 Support for the Fortran 95 Standard Features

This section briefly describes the Fortran 95 language features that have been added to Compaq Fortran:

The FORALL statement and construct
In Fortran 95/90, you could build array values element-by-element by using array constructors and the RESHAPE and SPREAD intrinsics. The Fortran 95 FORALL statement and construct offer an alternative method.
FORALL allows array elements, array sections, character substrings, or pointer targets to be explicitly specified as a function of the element subscripts. A FORALL construct allows several array assignments to share the same element subscript control.
FORALL is a generalization of WHERE. They both allow masked array assignment, but FORALL uses element subscripts, while WHERE uses the whole array.
Compaq Fortran previously provided the FORALL statement and construct as language extensions.
PURE procedures
Pure user-defined procedures do not have side effects, such as changing the value of a variable in a common block. To specify a pure procedure, use the PURE prefix in the function or subroutine statement. Pure functions are allowed in specification statements.
Compaq Fortran previously allowed pure procedures as a language extension.
ELEMENTAL procedures
An elemental user-defined procedure is a restricted form of pure procedure. An elemental procedure can be passed an array, which is acted upon one element at a time. To specify an elemental procedure, use the ELEMENTAL prefix in the function or subroutine statement.
Pointer initialization
In Fortran 95/90, there was no way to define the initial value of a pointer or to assign a null value to the pointer by using a pointer assignment operation. A Fortran 95/90 pointer had to be explicitly allocated, nullified, or associated with a target during execution before association status could be determined.
Fortran 95 provides the NULL intrinsic function that can be used to nullify a pointer.
Derived-type structure default initialization
Fortran 95 lets you specify, in derived-type definitions, default initial values for derived-type components.
Automatic deallocation of allocatable arrays
Arrays declared using the ALLOCATABLE attribute can now be automatically deallocated in cases where Fortran 95/90 would have assigned them undefined allocation status.
Compaq Fortran previously provided this feature as a language extension.
CPU_TIME intrinsic subroutine
This new intrinsic subroutine returns a processor-dependent approximation of processor time.
Enhanced CEILING and FLOOR intrinsic functions
KIND can now be specified for these intrinsic functions.
Enhanced MAXLOC and MINLOC intrinsic functions
DIM can now be specified for the MAXLOC and MINLOC intrinsic functions. DIM was allowed previously as a Compaq Fortran language extension.
Enhanced SIGN intrinsic function
The SIGN function can now distinguish between positive and negative zero (if the processor is capable of doing so).
Enhanced WHERE construct
The WHERE construct has been improved to allow nested WHERE constructs and a masked ELSEWHERE statement. WHERE constructs can now be named.
Comments allowed in namelist input
Fortran 95 allows comments (beginning with !) in namelist input data. Compaq Fortran previously allowed this as a language extension.
Generic identifier to END INTERFACE statement
The END INTERFACE statement of an interface block defining a generic routine now allows a generic identifier.
Zero-length formats
On output, when using I, B, O, Z and F edit descriptors, the specified value of the field width can be zero (0). In such cases, the compiler selects the smallest possible positive actual field width that does not result in the field being filled with asterisks.
New obsolescent features
Fortran 95 deletes several language features that were obsolescent in Fortran 90, and identifies new obsolescent features:
- REAL and DOUBLE PRECISION DO variables
- Branching to an ENDIF from outside its IF
- PAUSE statement
- ASSIGN statement, assigned GOTO, and assigned FORMATs
- H edit descriptor
Compaq Fortran flags these deleted and obsolescent features, but fully supports them.

1.12.3 Preliminary Information on Support for Big Objects

Big objects are data items whose size cannot be represented by a signed 32 bit integer. Compaq Fortran supports larger objects than Compaq Fortran 77.

Big objects are good for massive machines and clusters used for numerical analysis, such as weather forecasting and high energy physics problems. Both special knowledge and very large hardware configurations are needed to use this feature.

Your system and its operating system must be configured to:

Allow a very large stack space.
Allow a very large data space.
Allow large values for parameters, such as vm-maxvas.
Unless huge amounts of physical memory are present, enable lazy swapping.
Check the size of page/swap files and create larger files if needed.

For more information, see the Compaq Tru64 UNIX system management documentation. For Compaq Tru64 UNIX Version 4.0, you can use the following check list:

Either have a large swap space or use deferred swap allocation. This involves either:
- Have more swap space than the address space used by the largest program you want to run. The following command shows swap allocation:
  $ /usr/sbin/swapon -s
- Use the deferred mode of swap allocation. The following command displays the reference (man) page for swapon, which describes how to change the swap allocation
  $ man swapon

Reconfigure the UNIX kernel (for Version 4.0 or later) to change the following parameters as desired. For example, on one system, all values were set to 16 GB:

Parameter	Explanation
max-per-proc-address-space	Largest address space
max-per-proc-data-size	Largest data size
max-per-proc-stack-size	Largest stack size
vm-maxvas	Largest virtual-memory

Also set the following per-process values:

Parameter	Explanation
per-proc-address-space	Default address space
per-proc-data-size	Default data size
per-proc-stack-size	Default stack size

The per-process limits can be checked and increased with the limit or ulimit commands.

You can create big objects as static data, automatic data (stack), or dynamically allocated data (ALLOCATE statement or other means).

The address space limitations depends on the Alpha processor generation in use:

Address space for ev4 Alpha generation processors is 2**42
Address space for ev5 Alpha generation processors is 2**46

Although the compiler produces code that computes 63-bit signed addresses, objects and addresses larger than the hardware limitations will not work.

Limitations of using big objects include:

Initializing big objects by using a DATA statement or a TYPE declaration is not supported.
Big objects cannot be passed by value as program arguments.
Debug support for big objects is limited.
I/O of entire big objects is not supported, but I/O of parts of an array should work.

The following small example program allocates a big character object:

character xx(2_8**31+100_8) integer*8 i i = 10 xx(i) = 'A' i = 2_8**31 + 100_8 xx(i) = 'B' print *,xx(10_8) print *,xx(i) end

1.12.4 New Random Number Algorithm

A new random_number intrinsic (Version 4.0 or later) uses a different algorithm than the one previously used.

The test program below shows the use of the random_seed and random_number intrinsics.

program testrand intrinsic random_seed, random_number integer size, seed(2), gseed(2), hiseed(2), zseed(2) real harvest(10) data seed /123456789, 987654321/ data hiseed /-1, -1/ data zseed /0, 0/ call random_seed(SIZE=size) print *,"size ",size call random_seed(PUT=hiseed(1:size)) call random_seed(GET=gseed(1:size)) print *,"hiseed gseed", hiseed, gseed call random_seed(PUT=zseed(1:size)) call random_seed(GET=gseed(1:size)) print *,"zseed gseed ", zseed, gseed call random_seed(PUT=seed(1:size)) call random_seed(GET=gseed(1:size)) call random_number(HARVEST=harvest) print *, "seed gseed ", seed, gseed print *, "harvest" print *, harvest call random_seed(GET=gseed(1:size)) print *,"gseed after harvest ", gseed end program testrand

When executed, the program produces the following output:

% testrand size 2 hiseed gseed -1 -1 171 499 zseed gseed 0 0 2147483562 2147483398 seed gseed 123456789 987654321 123456789 987654321 harvest 0.6099895 0.9807594 0.2936640 0.9100146 0.8464803 0.4358687 2.5444610E-02 0.5457680 0.6483381 0.3045360 gseed after harvest 375533067 1869030476

1.12.5 Compaq Fortran 77 Pointers

Compaq Fortran 77 pointers are CRAY® style pointers, an extension to the Fortran 90 standard. The POINTER statement establishes pairs of variables and pointers, as described in the Compaq Fortran Language Reference Manual.

1.12.6 Extended Precision REAL (KIND=16) Floating-Point Data

The X_float data type is a little endian IEEE-based format that provides extended precision. It supports the REAL*16 Compaq Fortran Q intrinsic procedures. For example, the QCOS intrinsic procedure for the generic COS intrinsic procedure.

The value of REAL (KIND=16) data is in the approximate range: 6.475175119438025110924438958227647Q-4966 to 1.189731495357231765085759326628007Q4932.

Unlike other floating-point formats, there is little if any performance penalty from using denormalized extended-precision numbers, since accessing denormalized numbers do not result in an arithmetic trap (extended-precision is emulated in software). (The smallest normalized number is 3.362103143112093506262677817321753Q-4932.)

The precision is approximately one part in 2**112 or typically 33 decimal digits.

The X_float format is emulated in software. Although there is no standard IEEE little endian 16-byte REAL data type, the X_float format supports IEEE exceptional values.

For more information, see the revised Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems and the Compaq Fortran Language Reference Manual.

1.12.7 Variable Format Expressions (VFEs)

By enclosing an arithmetic expression in angle brackets, you can use it in a FORMAT statement wherever you can use an integer (except as the specification of the number of characters in the H field). For example:

J = 5 FORMAT (I<J+1>)

For more information, see the Compaq Fortran Language Reference Manual.

1.12.8 Notes on Debugger Support

Compaq Tru64 UNIX provides both the dbx and the Compaq Ladebug (formerly DECladebug) debuggers in the programming environment subsets.

These debuggers are very similar and use almost identical set of commands and command syntax. Both have a command-line interface as well as a Motif® windowing interface.

A character-cell Ladebug (ladebug) interface is provided with Ladebug in the Compaq Tru64 UNIX operating system Programmer's Development Toolkit. To use the character-cell interface, use the ladebug command.

When using Ladebug with certain versions of the UNIX operating system, be aware that a trailing underscore may be needed to display module variables. For example, to display variable X in module MOD, type:

print $MOD$X$_

The Parallel Software Environment supports debugging parallel HPF programs (see the DIGITAL High Performance Fortran 90 HPF and PSE Manual). This section addresses scalar (nonparallel) debugging.

When using the f90 command to create a program to be debugged using dbx or ladebug , consider using the following options:

Specify -g or -g2 to request symbol table and traceback information needed for debugging.
Avoid requesting optimization. When you specify -g or -g2 , the optimization level is set to -o0 by default. Debugging optimized code is neither easy nor recommended.
When using the Ladebug debugger, you should specify the -ladebug option. The -ladebug option allows you to print and assign to dynamic arrays using standard Fortran syntax.

For example, the following command creates the executable program proj_dbg.out for debugging with Ladebug:

% f90 -g -ladebug -o proj_dbg.out file.f90

You invoke the character-cell Ladebug debugger by using the ladebug command.

For more information, see the debugger chapter in the revised Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems (Chapter 4).

1.12.8.1 Ladebug Debugger Support Notes

The following improvements in Ladebug support for the Compaq Fortran language were added for DIGITAL UNIX Version 4.0:

Ladebug now includes a graphical window interface.
Ladebug now supports the display of array sections.
Ladebug now displays Fortran data types using Fortran 90 name syntax rather than C names (such as integer rather than int).
Ladebug provides improved support for debugging mixed-language C and Fortran applications.
These and other improvements are described in the debugger chapter (Chapter 4) of the Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems.

The following improvements in Ladebug support for the Fortran 90 language were added for DEC OSF/1 Version 3.2 (DECladebug V3.0-16):

Fortran and Fortran 90 language expression evaluation is built into the Ladebug command language, including:
- Case-insensitive identifiers, variables, program names, and so on
- Logical expressions, including:
  Relational operators (.LT., .LE., .EQ., .NE., .GT., .GE.)
  Logical operators (.XOR., .AND., .OR., .EQV., .NEQV., .NOT.)
Fortran 90 pointers
Fortran 90 array support, including:
- Explicit-shape arrays
- Assumed-shape arrays
- Automatic arrays
- Assumed-size arrays
- Deferred-shape arrays
COMMON block support, including:
- Display of whole common block
- Display of (optionally-qualified) common block members
COMPLEX variable support, including the display, assignment, and use of arithmetic expressions involving COMPLEX variables
Alternate entry points, including breakpoints, tracepoints, and stack tracing ( where command)
Mixed-language debugging

For more information on using Ladebug, see the debugger chapter in the revised Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems (Chapter 4).

1.12.8.2 dbx Debugger Support Notes

When using dbx with Compaq Fortran programs, certain differences exist. For example, in dbx , assumed-shape arguments, allocatable arrays, and pointers to arrays are printed as a derived type. Consider the following program:

module foo real x contains subroutine bar(a) integer a(:) a(1) = 1 end subroutine bar end module foo use foo integer b(100) call bar(b) end

If the above program were stopped inside BAR, the following would occur:

(dbx) print a common / dim = 1 element_length = 4 ptr = 0x140000244 ies1 = 4 ub1 = 10 lb1 = 1 /

The meaning of the fields are:

dim - dimension of the object
element_length - the length of each element in bytes
ptr - the address of the object
iesn - distance (in bytes) between elements in the nth dimension
ubn - upper bound in the nth dimension
lbn - lower bound in the nth dimension

1.12.9 Notes on Fast Math Library Routines

The f90 option -math_library fast provides alternate math routine entry points to the following:

SQRT, EXP, LOG, LOG10, SIN, and COS intrinsic procedures
Power (**) in arithmetic expressions

1.12.10 The Compaq Fortran Array Descriptor Format

In the Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems, Chapter 10, Section 10.1.7 describes the Compaq Fortran array descriptor format.

These notes are an initial attempt to provide a template for those C programmers creating an a .h file that lays out the Fortran array descriptor format.

There are two varying parameters for this descriptor format:

The element type (shown in this section as <ELEMENT_TYPE> )
The rank (shown in this section as <RANK> )

Common information for all descriptors is the general layout of the header and the information for each dimension.

One possible C @codefont(struct) definition for the per-dimension information is:

struct _f90_array_dim_info { int inter_element_spacing; int pad1; int upper_bound; int pad2; int lower_bound; int pad3; };

The inter-element spacing is measured in 8-bit bytes, not in array elements. This presents a challenge in designing array descriptor definitions in C, since there is no completely clean way to interact with C's pointer arithmetic.

One way to design the struct definition for an array descriptor is to use the template:

struct _f90_array_desc_rank<RANK>_<NAME_TOKEN> { unsigned char dim; unsigned char flags; unsigned char dtype; unsigned char class; int pad; long length; <ELEMENT_TYPE> * pointer; long arrsize; void * addr_a0; struct _f90_array_dim_info dim_info[<RANK>]; };

Where <RANK>, <NAME_TOKEN> and <ELEMENT_TYPE> are the template parameters. Often <NAME_TOKEN> and <ELEMENT_TYPE> can be the same, but in cases where <ELEMENT_TYPE> has non-identifier characters in it (for example, space or star) then a suitable <NAME_TOKEN> should be devised.

The problem with this approach is that the element addressing, which uses the inter-element spacing, generates an offset in bytes. In order to use C's native pointer arithmetic, either casts need to be done or a division. For example:

Casting:

*((<ELEMENT_TYPE> *) (((char *) desc->pointer) + byte_offset))

Division:

(desc->pointer)[byte_offset/sizeof(<ELEMENT_TYPE>)]

Another way to design the struct definition for an array descriptor is to use the template:

struct _f90_array_desc_rank<RANK>_general { unsigned char dim; unsigned char flags; unsigned char dtype; unsigned char class; int pad; long length; char * pointer; long arrsize; void * addr_a0; struct _f90_array_dim_info dim_info[<RANK>]; };

An advantage to this approach is that the same definition can be used for all arrays of the same rank. The problem with this approach is that it forces the programmer to cast:

*((<ELEMENT_TYPE> *) (desc->pointer + byte_offset))

Another approach is to remove <RANK> from the template as well, yielding:

struct _f90_array_desc_general { unsigned char dim; unsigned char flags; unsigned char dtype; unsigned char class; int pad; long length; char * pointer; long arrsize; void * addr_a0; struct _f90_array_dim_info dim_info[7]; };

On the last line, 7 is used since that is the maximum rank allowed by Fortran. Since the dim field should be checked, this definition can be used in many (perhaps most) of the places a rank-specific definition would be used, provided the programmer is aware that the dim_info fields beyond the actual rank are undefined.

One place such a definition should NOT be used is when an object of this definition is used as part of an assignment. This usage is considered rare. For example:

void ptr_assign_buggy(struct _f90_array_desc_general * ptr, struct _f90_array_desc_general * tgt) { *ptr = *tgt; }

Example of Array Descriptor Format Use

In this example, we have a 'struct tree' and a procedure prune_some_trees_() that takes a descriptor of a rank=3 array of such structs and calls prune_one_tree_() on each individual tree (by reference):

void prune_some_trees(struct _f90_array_desc_general * trees) { if (trees->dim != 3) { raise_an_error(); return; } else { int x,y,z; int xmin = trees->dim_info[0].lower_bound; int xmax = trees->dim_info[0].upper_bound; int xstp = trees->dim_info[0].inter_element_spacing; int ymin = trees->dim_info[1].lower_bound; int ymax = trees->dim_info[1].upper_bound; int ystp = trees->dim_info[1].inter_element_spacing; int zmin = trees->dim_info[2].lower_bound; int zmax = trees->dim_info[2].upper_bound; int zstp = trees->dim_info[2].inter_element_spacing; int xoffset,yoffset,zoffset; for (z = zmin, zoffset = 0; z <= zmax; z+= 1, zoffset += zstp) { for (y = ymin, yoffset = 0; y <= ymax; y+= 1, yoffset += ystp) { for (x = xmin, xoffset = 0; x <= xmax; x+= 1, xoffset += xstp) { struct tree * this_tree = (struct tree *) (trees->pointer + xoffset+yoffset+zoffset); prune_one_tree_(this_tree); } } } } }

Compaq would appreciate feedback on which definitions of array descriptors users have found most useful.

Note that the format for array descriptors used by HPF is more complicated and is not described at this time.

Chapter 2
New Features for Compaq Fortran Versions 4.n, 2.0, and 1.n Releases

This chapter summarizes the new features for Compaq Fortran Versions prior to Version 5.0:

New Features and Corrections in Version 4.1 ( Section 2.1)
New Features in Version 4.0 ( Section 2.2)
New Features in Version 2.0 ( Section 2.3)
New Features in Version 1.3 ( Section 2.4)
New Features in Version 1.2 ( Section 2.5)
New Features in Version 1.1 ( Section 2.6)

2.1 New Features and Corrections in Version 4.1

Version 4.1 is a maintenance release that contains a limited number of new features and corrections to problems discovered since Version 4.0 was released.

For additional information added to these release notes for Version 4.1, see Section 1.12.3.

The following new features have been added for DIGITAL Fortran 90 Version 4.1:

This release includes a partial implementation of the proposed Fortran 95 standard.
The following features of the proposed Fortran 95 standard have been implemented by this version of DIGITAL Fortran 90 and are supported when using the f90 or f95 commands:
- FORALL statement and construct (implemented prior to Version 4.1)
- Automatic deallocation of ALLOCATABLE arrays (implemented prior to Version 4.1)
- Dim argument to MAXLOC and MINLOC (implemented prior to Version 4.1)
- PURE user-defined subprograms (implemented prior to Version 4.1)
- ELEMENTAL user-defined subprograms (a restricted form of a pure procedure)
- Pointer initialization (initial value)
- The NULL intrinsic to nullify a pointer
- Derived-type structure initialization
- CPU_TIME intrinsic subroutine
- Kind argument to CEILING and FLOOR intriniscs
- Enhanced SIGN intrinsic function
- Nested WHERE constructs, masked ELSEWHERE statement, and named WHERE constructs
- Comments allowed in namelist input
- Generic identifier in END INTERFACE statements
- Detection of Obsolescent and/or Deleted features listed in the proposed Fortran 95 standard
For more information on Fortran 95 features, see the Section 1.12.2.
The f95 command is now available for use with the -std option:
- To perform standards conformance checking against the Fortran 90 standard, use the f90 command with -std . Using f90 with -std will issue messages for features (such as FORALL) that have recently been added to the proposed Fortran 95 standard (as well as other extensions to the Fortran 90 standard).
- To perform standards conformance checking against the Fortran 95 standard, use the f95 command with -std . Using f95 with -std will not issue messages for features (such as FORALL) that have been added to the proposed Fortran 95 standard, but will issue messages for extensions to the Fortran 95 standard.
For more information on Fortran 95 features, see the Section 1.12.2.
The -intconstant option has been added.
Specify the -intconstant option to use DIGITAL Fortran 77 rather than Fortran 90 semantics to determine kind of integer constants. If you do not specify -intconstant , Fortran 90 semantics are used.
Fortran 77 semantics require that all constants are kept internally by the compiler in the highest precision possible. For example, if you specify -intconstant , an integer constant of 14 will be stored internally as INTEGER(KIND=8) and converted by the compiler upon reference to the corresponding proper size. Fortran 90 specifies that integer constants with not explicit KIND are kept internally in the default INTEGER kind (KIND=4 by default).
Similarly, the internal precision for floating-point constants is controlled by the -fpconstant option.
The -pad_source option has been added.
Specify the -pad_source option to request that source records shorter than the statement field width are to be padded with spaces on the right out to the end of the statement field. This affects the interpretation of character and Hollerith literals that are continued across source records.
The default is -nopad_source , which causes a warning message to be displayed if a character or Hollerith literal that ends before the statement field ends is continued onto the next source record. To suppress this warning message, specify the -warn nousage option.
Specifying -pad_source can prevent warning messages associated with -warn usage .
The -warn usage option has been added.
Specify the -warn nousage option to suppress warning messages about questionable programming practices which, although allowed, often are the result of programming errors. For example, a continued character or Hollerith literal whose first part ends before the statement field ends and appears to end with trailing spaces is detected and reported by -warn usage .
The default is -warn usage .
The -arch keyword option has been added.
This option determines the type of Alpha chip code that will be generated for this program. The -arch keyword option uses the same keywords as the -tune keyword option.
Whereas the -tune keyword option primarily applies to certain higher-level optimizations for instruction scheduling purposes, the -arch keyword option determines the type of code instructions generated for the program unit being compiled.
DIGITAL UNIX Version 4.0 and subsequent releases provide an operating system kernel that includes an instruction emulator. This emulator allows new instructions, not implemented on the host processor chip, to execute and produce correct results. Applications using emulated instructions will run correctly, but may incur significant software emulation overhead at runtime.
All Alpha processors implement a core set of instructions. Certain Alpha processor versions include additional instruction extensions.
Supported -arch keywords are as follows:
- -arch generic generates code that is appropriate for all Alpha processor generations. This is the default.
  Running programs compiled with the generic keyword will run all implementations of the Alpha architecture without any instruction emulation overhead.
- -arch host generates code for the processor generation in use on the system being used for compilation.
  Running programs compiled with this keyword on other implementations of the Alpha architecture might encounter instruction emulation overhead.
- -arch ev4 generates code for the 21064, 21064A, 21066, and 21068 implementations of the Alpha architecture.
  Running programs compiled with the ev4 keyword will run without instruction emulation overhead on all Alpha processors.
- -arch ev5 generates code for some 21164 chip implementations of the Alpha architecture that use only the base set of Alpha instructions (no extensions).
  Running programs compiled with the ev5 keyword will run without instruction emulation overhead on all Alpha processors.
- -arch ev56 generates code for some 21164 chip implementations that use the byte and word manipulation instruction extensions of the Alpha architecture.
  Running programs compiled with the ev56 keyword might incur emulation overhead on ev4 and ev5 processors, but will still run correctly on DIGITAL UNIX Version 4.0 (or later) systems.
- -arch pca56 generates code for the 21164PC chip implementation that uses the byte and word manipulation instruction extensions and multimedia instruction extensions of the Alpha architecture.
  Running programs compiled with the pca56 keyword might incur emulation overhead on ev4 and ev5 and ev56 processors, but will still run correctly on DIGITAL UNIX Version 4.0 (or later) systems.
In addition to ev4, ev5, generic, and host, The ev56 and pca56 keywords are now supported for the -tune option.

The following new High Performance Fortran features have been added for DIGITAL Fortran 90 Version 4.1:

Transcriptive data distributions are now supported.
The INHERIT directive can now be used to inherit distributions, as well as alignments.
Distributed components of user-defined types are now handled in parallel. This is not part of standard High Performance Fortran (HPF), but is an approved extension.
The GLOBAL_SHAPE and GLOBAL_SIZE HPF Local Library routines are now supported.
There is a new compile-time option named -show hpf , which replaces the -show wsfinfo option. The -show hpf option provides performance information at compile time. Information is given about inter-processor communication, temporaries created at procedure boundaries, optimized nearest-neighbor computations, and code that is not handled in parallel. You can choose the level of detail you wish to see.
New example programs are available in the following directory:
/usr/examples/HPF

These new features are described in the DIGITAL High Performance Fortran 90 HPF and PSE Manual.

The corrections made for DIGITAL Fortran 90 Version 4.1 include the following:

Fix compiler abort with certain types of pointer assignment.
Fix incorrect error message for nested STRUCTUREs.
Fix inconsistent severity for undeclared variable message with IMPLICIT NONE or command line switch.
Fix incorrect error about passing LOGICAL*4 to a LOGICAL*1 argument.
Add standards warning for non-integer expressions in computed GOTO.
Do not flag NAME= as nonstandard in INQUIRE.
Add standards warning for AND, OR, XOR intrinsics.
VOLATILE attribute now honored for COMMON variables.
Allow COMPLEX expression in variable format expression.
Allow adjustable array to be declared AUTOMATIC (AUTOMATIC declaration is ignored.)
Honor -automatic (/RECURSIVE) in main program.
Fix incorrect parsing error when DO-loop has bounds of -32768,32767.
Fix compiler abort when extending generic intrinsic.
Fix SAVEd variable in inlined routine that didn't always get SAVEd.
Fix compiler abort with initialization of CHARACTER(LEN=0) variable
Correct values of PRECISION, DIGITS, etc. for floating types.
Fix incorrect value of INDEX with zero-length strings.
Correct value for SELECTED_REAL_KIND in PARAMETER statement.
Correct compile-time result of VERIFY.
For OpenVMS only, routine using IARGPTR or IARGCOUNT corrupts address of passed CHARACTER argument.
Standards warning for CMPLX() in initialization expression.
Fix compiler abort when %LOC(charvar) used in statement function.
Fix incorrect initialization of STRUCTURE array.
Fix compiler abort with large statement function.
RESHAPE of array with a zero bound aborts at runtime.
For OpenVMS only, /INTEGER_SIZE now correctly processed.
SIZEOF(SIZEOF()) is now 8.
Fix error parsing a derived type definition with a field name starting with "FILL_".
With OPTIONS /NOI4, compiler complained of IAND with arguments of an INTEGER*4 variable and a typeless PARAMETER constant.
Fix incorrect standards warning for DABS.
Add error message for ambiguous generic.
Corrected error parsing field array reference in IF.
Bit constants in argument lists are typed based on value, not "default integer".
Allow module to use itself.
Fix standards warning for Hollerith constant.
For OpenVMS only, FOR$RAB is always INTEGER*4.
For OpenVMS only, wrong values for TINY, HUGE for VAX floating.
For OpenVMS only, EXPONENT() with /FLOAT=D_FLOAT references non-existent math routine.
The Compaq Fortran run-time library incorrectly failed to release previously allocated memory when padding Fortran 90 input.
The Compaq Fortran run-time library would incorrectly go into an infinite loop when an embedded NULL character value was found while performing a list-directed read operation.
The Compaq Fortran run-time library would incorrectly treat an end-of-record marker as a value separator while performing a list-directed read operation.
The Compaq Fortran run-time library incorrectly produced a "recursive I/O operation" error after a user has made a call to flush() to flush a unit which was not previously opened, then attempted to perform any I/O operation on the unit
The Compaq Fortran run-time library would incorrectly fail to return an end-of-record error for certain non-advancing I/O operations. This occurred when attempting to read into successive array elements while running out of data.
The Compaq Fortran run-time library, when it encountered the ":" edit descriptor at the end of the input record did not stop reading the next record, causing errors like "input conversion".
The Compaq Fortran run-time library did not handle implied DO loops on I/O lists when non-native file support ( -convert or equivalent conversion method) was in use.

The following are corrections for HPF users in this version:

Expanded I/O Support, including support for all features, including: complicated I/O statements containing function calls, assumed size arrays, or variables of derived types with pointer components, and array inquiry intrinsics using the implied DO loop index.
In addition, non-advancing I/O (except on stdin and stdout) now works correctly if every PSE peer in the cluster has a recent version of the Fortran run-time library (fortrtl_371 or higher).
NUMBER_OF_PROCESSORS and PROCESSORS_SHAPE in EXTRINSIC(HPF_SERIAL) routines
Restriction lifted on user-defined types in some FORALLs
Declarations in the specification part of a module
EXTRINSIC(SCALAR) changed to EXTRINSIC(HPF_SERIAL)

These new corrections are described in more detail in the Parallel Software Environment (PSE) release notes.

2.2 New Features in Version 4.0

The following f90 command options were added for DIGITAL Fortran 90 Version 4.0:

Specify the -assume byterecl option to::
- Indicate that the OPEN statement RECL unit for unformatted files is in byte units. If you omit -assume byterecl , Compaq Fortran expects the OPEN statement RECL value for unformatted files to be in longword (four-byte) units.
- Return the record length value for an INQUIRE by output list (unformatted files) in byte units. If you omit -assume byterecl , Compaq Fortran returns the RECL value for an INQUIRE by output list in longword (four-byte) units.
The -check nopower option allows arithmetic calculations that result in 0**0 or a negative number raised to an integer power of type real (such as --3**3.0) to be calculated, rather than stop the program. If you omit -check nopower for such calculations, an exception occurs and the program stops (default is -check:power ).
For example, if you specified -check:nopower , the calculation of the expression 0**0 results in 1 and the expression --3**3.0 results in --9.
Specify -hpf_matmul to use matrix multiplication from the HPF library. Omitting the -hpf_matmul option uses inlined intrinsic code that is faster for small matrices. For nonparallel compilations, specifying -hpf_matmul to use the HPF library routine is faster for large matrices.
The -names keyword option controls how DIGITAL Fortran 90 handles the case-sensitivity of letters in source code identifiers and external names:
- Using -names as_is requests that Compaq Fortran distinguish between uppercase and lowercase letters in source code identifiers (treats uppercase and lowercase letters as different) and distinguish between uppercase and lowercase letters in external names.
- Using -names lowercase (default) requests that Compaq Fortran not distinguish between uppercase and lowercase letters in source code identifiers (treats lowercase and uppercase letters as equivalent) and force all letters to be lowercase in external names.
- Using -names uppercase requests that Compaq Fortran not distinguish between uppercase and lowercase letters in source code identifiers (treats lowercase and uppercase letters as equivalent) and force all letters to be uppercase in external names.
The -noinclude option prevents searching for include files in the /usr/include directory. This option does not apply to the directories searched for module files or cpp files.
The -o5 option activates both the software pipelining optimization ( -pipeline ) and the loop transform optimizations ( -transform_loops ). You can also specify -notransform_loops or -nopipeline with -o5 .
If you also specify the -wsf option to request parallel processing, you cannot use the -o5 option.
The -pipeline option activates the only software pipelining optimization (previously done only by -o5 ). The software pipelining optimization applies instruction scheduling to certain innermost loops, allowing instructions within a loop to "wrap around" and execute in a different iteration of the loop. This can reduce the impact of long-latency operations, resulting in faster loop execution.
Software pipelining also enables the prefetching of data to reduce the impact of cache misses. In certain cases, software pipelining improves run-time performance (separate timings are suggested).

The following -reentrancy keyword options specify the level of thread-safe reentrant run-time library support needed:

Option Name	Description
`-reentrancy none`	Informs the Compaq Fortran RTL that the program will not be relying on threaded or asynchronous reentrancy. Therefore the RTL need not guard against such interrupts inside the RTL. This is the default.
`-reentrancy asynch`	Informs the Compaq Fortran RTL that the program may contain asynchronous handlers that could call the RTL. Therefore the RTL will guard against asynchronous interrupts inside its own critical regions.
`-reentrancy threaded`	Informs the Compaq Fortran RTL that the program is multithreaded, such as those using the DECthreads library. Therefore the RTL will use thread locking to guard its own critical regions. To use the threaded libraries, also specify `-threads` .
`-noreentrancy`	The same as `-reentrancy none` .

The -s option generates a .s file, which can be assembled. This option is intended for systems running Compaq Tru64 UNIX (DIGITAL UNIX) Version 4.0 or later, which has certain new Assembler features.
Certain complex programs that use modules or common blocks compiled with -s may not generate code completely acceptable to the Assembler.
The -speculate keyword option supports the speculative execution optimization:
- Use -speculate all to perform the speculative execution optimization on all routines in the program. All exceptions within the entire program will be quietly dismissed without calling any user-mode signal handler.
- Use -speculate by_routine to perform the speculative execution optimization on all routines in the current compilation unit (set of routines being compiled), but speculative execution will not be performed for routines in other compilation units in the program.
- Use -speculate none or -nospeculate to suppress the speculative execution optimization. This is the default.
The speculative execution optimization reduces instruction latency stalls to improve run-time performance for certain programs or routines. This optimization evaluates conditional code (including exceptions) and moves instructions that would otherwise be executed conditionally to a position before the test, so they are executed unconditionally.
Speculative execution does not support some run-time error checking, since exception and signal processing (including SIGSEGV, SIGBUS, and SIGFPE) is conditional. When the program needs debugging or while testing for errors, use -speculate none (default).
Specifying -threads requests that the linker use threaded libraries. This is usually used with -reentrancy threaded .
The -transform_loops option supports a group of optimizations that improve the performance of the memory system and can apply to multiple nested loops. The loops chosen for loop transformation optimizations are always counted loops (counted loops include DO or IF loops, but not uncounted DO WHILE loops). In certain cases, loop transformation improves run-time performance (separate timings are suggested).
Specify -nowsf_main to indicate that the HPF global routine being compiled will be linked with a main program that was not compiled with -wsf .

For more information on f90 command options, see the Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems, Chapter 3, or f90(1).

In addition to the f90 command-line options, the following new or changed features were added for Version 4.0:

The random_number intrinsic (as of Version 4.0) uses two separate congruential generators together to produce a period of approximately 10**18, and produces real pseudorandom results with a uniform distribution in (0,1). It accepts two integer seeds, the first of which is reduced to the range [1, 2147483562]. The second seed is reduced to the range [1, 2147483398]. This means that the generator effectively uses two 31-bit seeds.
The new algorithm behaves differently from one provided prior to Version 4.0 in the following ways:
- Both seeds are active and contribute to the random number being produced.
- If the given seeds are not in the ranges given above, they will be reduced to be in those ranges.
- The sequences of random numbers produced by the new generator will be different from the sequences produced by the old generator.
For more information on the algorithm, see:
- Communications of the ACM vol 31 num 6 June 1988, entitled Efficient and Portable Combined Random Number Generators by Pierre L'ecuyer
- Springer-Verlag New York, N. Y. 2nd ed. 1987, entitled A Guide to Simulation by Bratley, P., Fox, B. L., and Schrage, L. E.
For an example program, see Section 1.12.4.
The implementation of the MATMUL intrinsic procedure was changed for this release. Previously the compiler called a routine in the scalar HPF library to perform the operation. As of this release, the compiler generates optimized inline code for the MATMUL intrinsic with a significant increase in the performance when the size of the array arguments are small.
To use previous implementation of the MATMUL intrinsic (routine in the scalar HPF library), specify -hpf_matmul .
The cDEC$ ALIAS directive
The cDEC$ ALIAS directive is now supported in the same manner as in Compaq Fortran 77. This directive provides the ability to specify that the external name of an external subprogram is different than the name by which it is referenced in the calling subprogram.
This feature can be useful when porting code between OpenVMS and UNIX systems where different routine naming conventions are in use.
For more information on the cDEC$ ALIAS directive, see the Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems.
The cDEC$ ATTRIBUTES directive
The cDEC$ ATTRIBUTES directive lets you specify properties for data objects and procedures. These properties let you specify how data is passed and the rules for invoking procedures. This directive is intended to simplify mixed language calls with Compaq Fortran routines written in C or Assembler.
For more information on the cDEC$ ATTRIBUTES directive, see Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems.
An additional math library allows use of optimizations for a series of square root calculations.
The library file libm_4sqrt ships on the DIGITAL Fortran 90 Version 4.0 kit (and DIGITAL Fortran 77 Version 4.0). These optimizations improve run-time performance when a series of square root calculations occur within a counted loop.
Enhanced support for the FORALL statement and construct
The FORALL construct now allows the following statements in the forall body:
- Pointer assignment statement
- FORALL statement or construct (nested FORALL)
- WHERE statement or construct
Please note that each statement in the FORALL body is executed completely before execution begins on the next FORALL body statement.
The compiler now correctly defines the scope of a FORALL subscript name to be the scope of the FORALL construct. That is, the subscript name is valid only within the scope of the FORALL. Its value is undefined on completion of the FORALL construct.
OPTIONS statement options can now be abbreviated (for compatibility with DIGITAL Fortran 77).
The -vms option now supports use of /LIST or /NOLIST in an INCLUDE statement (for compatibility with DIGITAL Fortran 77).
To improve run-time performance, new optimizations are now available and certain improvements have been made, including:
- Certain intrinsic procedures specific to Fortran 90 (not available in FORTRAN-77)
- Subprogram calls with array arguments
- New command-line options that activate new optimizations, including the loop transformation optimizations ( -transform_loops or -o5 ) and the speculative execution optimization ( -speculate keyword ). The software pipelining optimization is now activated by using -pipeline or -o5 .
Variable formats expressions (VFEs) are now allowed in quoted strings.
Invalid formats in quoted strings are now detected at compile-time rather than run-time.

For more information on compatibility with DIGITAL Fortran 77, see the revised Compaq Fortran User Manual for Tru64 UNIX and Linux Alpha Systems, Appendix A.

2.3 New Features in Version 2.0

New features for Version 2.0 include the LOC intrinsic function. LOC returns the internal address of its argument (same as the built-in function %LOC).

In addition, the Compaq Ladebug debugger has added support for Compaq Fortran language features (see Section 1.12.8.1).

2.4 New Features in Version 1.3

New features for Version 1.3 include the f90 command options that support the Compaq Parallel Software Environment.

To request parallel execution, specify the -wsf or -wsf nn option . This compiles the program to run in parallel using the Compaq Parallel Software Environment product. The optional nn parameter specifies the number of processors on which the program is intended to execute. If not specified, the program will be compiled to execute on any number of processors. More efficient code is generated when nn is specified.

If you specify the -wsf or -wsf nn option to request parallel execution, you can also use the following related options:

The -assume nozsize option assumes there are no zero-sized array sections.
The -nearest_neighbor or -nearest_neighbor nn option enables or disables the nearest neighbor parallel optimization. The optional nn parameter specifies the width of the shadow edge to use. If you omit nn, it is set to 1.
The -pprof string option allows parallel profiling of an application. Valid characters for string are s for sampling or i for interval. This option must be used with the -non_shared option (as well as -wsf or -wsf nn ). This option must not be used with the -p1 option.
The -show wsfinfo option includes information about statements which cause interprocessor communication to be generated or are serialized in the listing file.

Other Version 1.3 new features include the following:

Support for the DIGITAL Fortran 77 pointers (CRAY® style). This is an extension to the Fortran 95/90 and FORTRAN-77 standards. For more information, see the DEC Fortran Language Reference Manual and Section 1.12.5.
Bit constants with a trailing B or Z or leading X (a Compaq Fortran extension) are now supported for compatibility with Compaq Fortran 77:
i = '001'B k = '0ff'Z j = X'00f'
The SYSTEM_CLOCK intrinsic procedure has been extended to allow integer arguments of any KIND rather than the default integer KIND . This allows the use of INTEGER*8 arguments to obtain a higher degree of magnitude and accuracy in timings (1,000,000 counts per second). For example:
integer*8 count,count_max,count_rate call system_clock(count,count_rate,count_max)
When it is passed an INTEGER (KIND=4) value, the SYSTEM_CLOCK intrinsic procedure now returns a value in terms of 10,000 instead of 1,000,000 counts per second.
Debugging support has been enhanced to allow breakpoints on CONTINUE, GOTO, and RETURN statements. Before Version 1.3, breakpoints could not be set on a CONTINUE statement and only on certain GOTO and RETURN statements.
The following DIGITAL Fortran 90 cDEC$ directives are now supported:
- cDEC$ IDENT specifies a string that identifies the object file.
- cDEC$ OPTIONS and cDEC$ END_OPTIONS controls alignment of fields in common blocks, record structures, and most derived-type structures.
- cDEC$ PSECT modifies certain attributes of a common block, including the [NO]MULTILANGUAGE attribute for compatibility with DIGITAL Fortran 77.
- cDEC$ TITLE and cDEC$ SUBTITLE specifies strings for the title and subtitle of a listing file header.
Any number raised to a floating point 2.0 (x ** 2.0) is now transformed to (x ** 2) for compatibility with DIGITAL Fortran 77.
The Bessel function 3f library (jacket) routines are now supported (see bessel(3f))
The following f90 command options were added for Version 1.3:
- The -fuse_xref option requests that DIGITAL Fortran 90 generate a data file that the Compaq FUSE Database Manager uses to create a cross-reference database file. This improves the performance of the Compaq FUSE Call Graph Browser and Cross-Referencer that use the database file for their operations.
- The -inline speed and -inline size options have been added in place of -inline automatic to provide more control over procedure inlining:
  Use -inline size (same as -inline space ) to inline procedures that will likely improve run-time performance where inlining will not significantly increase program size. This option is meaningful only at optimization levels -o1 and higher.
  Use -inline speed to inline procedures that will likely improve run-time performance where inlining may significantly increase program size. Using -inline speed often results in larger executable program sizes (than -inline size ). This type of inlining occurs automatically with the -o4 or -o5 optimization levels. This option is meaningful only at optimization levels -o1 and higher.
  Other -inline xxxx options include -inline none , -inline manual , and -inline all (see Section 2.5).
- The -ladebug option includes additional symbolic information in the object file for the DIGITAL Ladebug debugger (see ladebug(1). This option enables Ladebug to print and assign to dynamic arrays using standard Fortran syntax, including array sections.
- The -show map option includes a symbol map in the listing file (also specify -v ).
- The -version option displays DIGITAL Fortran 90 version number information.

For more complete product information, see the Compaq Fortran documentation and the f90(1) reference (man) page.