Operator Messages Manual

Chapter 77 PAM Messages

The messages in this chapter are generated by the PAM subsystem. The subsystem ID displayed by these messages includes PAM as the subsystem name.

With this release, the numbers for the event messages have changed. The PAMMAN events, which used to be numbered 15 through 19, are now 101 through 106 (106 is new). The PAM events, which used to be numbered 1 through 14 are now numbered 201 through 216 (215 and 216 are new). Event messages 1 through 106 are generated by PAMMAN.

PAMMAN Event Messages



1

000001 PAM Manager internal Error

Cause   PAMMAN encountered an internal error.

Effect  The PAMMAN primary process might change to the backup process.

Recovery  This is a serious error. Contact your service provider.



2

000002 PAM Manager internal trap

Cause  PAMMAN encountered an internal error.

Effect  The PAMMAN primary process might change to the backup process.

Recovery  This is a serious error. Contact your service provider.



3

000003 PAM Manager backup is up

Cause  The PAMMAN backup process started.

Effect  PAMMAN is running as a NonStop process pair.

Recovery  None. This is an informational event.



4

000004 PAM Manager backup is down

Cause  The PAMMAN primary process could not start the backup process. The processor in which PAMMAN is attempting to start the backup process might not be started.

Effect  PAMMAN is not running as a NonStop process pair.

Recovery  Start the processor in which PAMMAN is attempting to start the backup process.



5

000005 PAM Manager create failure

Cause  PAMMAN could not create a backup process although the processor in which it is attempting to start the process is up.

Effect  PAMMAN does not run as a NonStop process pair.

Recovery  Switch the backup process to a different processor. See the SCF Reference Manual for the Kernel Subsystem for information about how to switch a manager process to another processor.



6

000006 PAM Manager checkpoint failure

Cause  The PAMMAN primary process was unable to create a checkpoint with the backup process.

Effect  The PAMMAN backup process does not have current information.

Recovery  If this error persists, abort and then restart the PAMMAN process. See the SCF Reference Manual for the Kernel Subsystem for information on aborting and starting a manager process.



7

000007 PAM Manager process create failure

Cause  PAMMAN could not create a PAM Monitor (PAMMON) process.

Effect  A PAMMON process in one of the processors is not running.

Recovery  Abort and then restart PAMMAN. See the SCF Reference Manual for the Kernel Subsystem for information on aborting and starting a manager process.



101

Add Object into NameSpace failed Object object-name Error Status: error-status

object-name

is the name of the object reporting this event.

error-code

is the internal error code.

Cause  PAMMAN is unable to add an object into its name-space, because:

  • The object is already defined in the name-space.

  • The supervisor object is not found in the name-space.

  • There is not enough memory to proceed with this operation.

Effect  The object is not added. The operation continues, but there is a possible name‑space inconsistency between PAM and PAMMAN.

Recovery  Stop and restart PAMMAN. If the problem doesn't go away, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



102

Delete Object from NameSpace failed Object object-name Error Status: error-status

object-name

is the name of the object reporting this event.

error-code

is the internal error code.

Cause  PAMMAN is unable to delete an object from its name-space because the name of the object is not found.

Effect  The object is not deleted. The operation continues, but there is a possible name‑space inconsistency between PAM and PAMMAN.

Recovery  Stop and restart PAMMAN. If the problem doesn't go away, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



103

Bad File Name from PAMMAN_IN_QUEUE File Name: file-name Error Status: error-status

file-name

is the name of the object reporting this event.

error-code

is a file-system error number indicating the outcome of the file name conversion operation.

Cause  The message PAMMAN received through QIO Global Queue contains an object name with a bad file-system naming format.

Effect  The message is discarded. The operation continues, but possible name-space inconsistency between PAM and PAMMAN.

Recovery  Stop and restart PAMMAN. If the problem doesn't go away, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



104

QIO MD with bad read/write pointer Read Ptr: read-ptr Write Ptr: write-ptr

read-ptr

is the read pointer of the message descriptor PAMMAN received through QIO.

write-ptr

is the write pointer of the message descriptor (MD) PAMMAN received through QIO.

Cause  PAMMAN received a bad MD while listening to the QIO Global Queue.

Effect  Possible name-space inconsistency between PAM and PAMMAN.

Recovery  Stop and restart PAMMAN. If the problem doesn't go away, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



105

Call to QIOLIB function failed Function Name function-name Error Status: error-status

function-name

is the name of the QIO library function that is called by PAMMAN and returns the error.

error-code

is the error code returned by the QIO library above.

Cause  PAMMAN detects an error while doing a QIO library operation.

Effect  This is a serious internal error. Depending on the QIO function performed, PAMMAN may or may not be able to continue operation.

Recovery  Contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



106

FILE_OPEN_ to PAM process failed File Name: filename Returned File No:file-no FS Error Code: err-code

filename

is the name of the PAM process that PAMMAN is trying to open.

file-no

is the file number returned from the returned FILE_OPEN_ call.

err-code

is the error returned by FILE_OPEN_().

Cause   PAM process is not responding properly to the FILE_OPEN_ call.

Effect  PAMMAN cannot open the PAM process. No SCF requests can be forwarded to that PAM process.

Recovery   Stop and restart the PAMMAN process. If this error persists, contact your service provider and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.

PAM Event Messages



201

count Inbound Ag SDU Not Delivered Over Last 3 Min. Port Name: filename SDN: sdn Cause: #cause

count

is the number of inbound aggregate SDUs not delivered in the last three minutes.

filename

is the name of the port.

sdn

is the subdevice number.

cause

specifies why this error occurred. The value of cause can be:

6The application had no READs posted.
7The aggregate SDU contained a bad address.

Cause  An inbound aggregate SDU was delivered to the port module but could not be delivered to a client (user of a port module).

Effect  The inbound aggregate SDU is discarded, and the error is recorded in the port’s statistics.

Recovery  If the aggregate SDU could not be delivered to the application, the subsystem is able to deliver aggregate SDUs faster than the client can post file-system reads. The client should increase the number of SDUs in the inbound aggregate SDU by means of the ALTER PORT command’s data forward count (DFC) attribute, which slows the rate that aggregate SDUs are delivered to the application.

If the aggregate SDU contained a bad address, the adapter has delivered an aggregate SDU to an unknown port. Data may be lost, and the client may have to perform higher‑level protocol. Contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



202

Bad Outbound Ag SDU Port Name: filename Cause: #cause

filename

is the name of the port.

cause

specifies why this error occurred. The value of cause can be:

6The aggregate service data unit (SDU) could not be read.
7The aggregate SDU contained a bad address.

Cause  The client has delivered an aggregate SDU to the subsystem that does not conform to the rules governing the format of these messages.

Effect  The adapter discards all remaining frames in the erroneous aggregate SDU message. SDUs previously taken from the message are not affected and are delivered to the media.

Recovery  The client should isolate the bad message, determine where it deviates from the proper format, and fix the offending code. If the client is using the ZL2S formatting library, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



203

Checkpoint Error Entity: rept-entity Chkpt Type: chkpt-type Cause: #cause

rept-entity

is the PAM module that reports the checkpoint error.

chkpt-type

is the type of checkpoint (such as CHKPT_SUBDEV_ADD, CHKPT_OPEN) that was in error.

cause

specifies why this error occurred. The value of cause can be:

1The backup process received a subdevice number that is not configured.
2The backup process received checkpoint data of the wrong size.
3READLINK failed when reading checkpoint data.
4The backup could not obtain memory for checkpointing additional control blocks.
5The backup received a checkpoint with an unexpected type.

Cause  The backup process cannot process a checkpoint.

Effect  No immediate problems result, but a switch can result in unpredictable actions by PAM.

Recovery  Halt and dump the processor of the PAM backup process, then reload.

If the problem persists, do not reload the backup; contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



204

filename switched from cpufrom to cputo due to LIF lifname

filename

is the name of the PAM process that switched.

cpufrom

is the processor from which the PAM process switched.

cputo

is the processor to which the PAM process switched.

lifname

is the name of the LIF that this PAM process is using.

Cause  The LIF associated with the PAM process either switched ownership from the current primary or it no longer has access on this processor.

Effect  PAM waits for the LIF to come up on the new processor and registers the ports. The application data transfer resumes. If the LIF switches processors again, PAM follows the LIF.

Recovery  If the PAM process keeps switching or does not recover, check the status of the associated LIF and PIF by using PFPFMON commands and change the access of the LIF to the processor you want. Bring up the LIF and PIF if they are down.



205

filename Device Down

filename

is the name of PAM process that went down.

Cause  The operator stopped the line with a Subsystem Control Facility (SCF) command, or it stopped itself because of a serious error.

Effect  The line is stopped.

Recovery  If the error was not forced by an ABORT or PFSTOP command, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



206

filename Device Up

filename

is the name of the PAM process that came up.

Cause  The operator successfully returned the device to the system through a Subsystem Control Facility (SCF) START command.

Effect  Processing continues normally.

Recovery  Informational message only; no corrective action is needed.



207

Process Internal Error: file-sys-err Module: module-name Line: line-num

file-sys-err

identifies a file-system error.

module-name

is the name of the PAM module that detected the internal error.

line-number

is the line number in the module where the internal error was detected.

Cause  PAM has detected a serious internal error. This error is generated at the point PAM detected the error.

Effect  Subsequent requests to the line are rejected with file-system error 157.

Recovery  Halt and dump the processor of the reporting process, then reload. Contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



208

IO Process Abended Module: module-name Line: line-num Error: #err-code Reason: #reason

module-name

is the name of the PAM module that detected the internal error.

line-number

is the line number in the module where the internal error was detected.

reason

indicates why this error occurred. The value of reason can be :

2Too few start-up arguments.
3Invalid start-up argument.
4Specified backup processor invalid.
5Invalid LIF name specified.
6Invalid maximum recsize specified.
13PAM process got into an unexpected function. Take dump and call your service provider.
14PAM process got an unexpected error from an NSK function call. Take dump and call your service provider.
15PAM process got an unexpected internal error. Take dump and call your service provider.
16PAM process got an unexpected error from the LIF. This PAM process abends. Check the status of the LIF and associated PIF/SAC in the SLSA subsystem, correct the SLSA problem, and restart the PAM process.
17PAM process does not have potential access to LIF from either the primary or backup processor. Check the status of the LIF and associated PIF/SAC in the SLSA subsystem. You either need to bring up PAM on the processors that have access to the LIF or change the SLSA configuration so that the processors in which you intend to bring up PAM have potential access.
18PAM process does not have data access to LIF from either the primary or backup processor. Check the status of the LIF and associated PIF/SAC in the SLSA subsystem. You either need to bring up PAM on the processors that have access to the LIF or change the SLSA configuration so that the processors in which you intend to bring up PAM have data access.

Cause  The PAM process detected an error at process initialization time and is unable to go on.

Effect  PAM terminates abnormally.

Recovery  Restart PAM with the correct values. If it still fails, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



209

Port Configuration Error Port Name: filename SDN: #sdn Error: #port-config-err Parm: #port-config-err-param

filename

is the name of the port.

sdn

is the subdevice number.

#port-config-err

is the port configuration error encountered while starting the port.

#port-config-err-param

is the port configuration parameter that was in error while starting the port.

Cause  A problem was encountered during communication with the adapter during the start/stop port phase.

Effect  The port stays in the stopped state.

Recovery  Recovery depends on the type of problem. If this message is followed by the message “Port Configuration Failed (PAM:18)”, the port has not entered the started state. Issue the START command again.

If the problem persists, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



210

Port Configuration Failed Port Name: filename SDN: #sdn

filename

is the name of the port.

sdn

is the subdevice number.

Cause  This message is generated if PAM could not configure a port. This error should be preceded by a port configuration error.

Effect  The port stays in the stopped state.

Recovery  Recovery depends on the type of problem. Issue the START command again.

If the problem persists, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



211

State Machine Error SM ID: #level-err State: %state Event: #evt

level-err

indicates which module is reporting a state-machine error.

%state

is the state machine’s current state at the time of an error.

evt

is the state-machine event number that was in error.

Cause  The PAM makes extensive use of state machines. This message is generated if the PAM process detects an internal state-machine error.

Effect  The subsystem’s action depends on the entity that reported the error. If one of the PORT SAPs or the other entities reports the problem, that entity is put in the STOPPED state but the rest of the subsystem functions normally.

Recovery  Reconfigure the object and continue operation.

Contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



212

UnExpected Controller Response Module: module-name Line: line-num DSAP: #ctlr-dsap SSAP: #ctlr-ssap Func: #ctlr-func Mod: #ctlr-mod

module-name

is the name of the PAM module that detected the internal error.

line-number

is the line number in the module where the internal error was detected.

ctlr-dsap

is a protocol packet that indicates the destination service access point (DSAP).

ctlr-ssap

is a protocol packet that indicates the source service access point (SSAP).

ctlr-func

is the function field in the controller response.

ctlr-mod

is the mod field in the controller response.

Cause  An unexpected line operation (LIOP) response was received by the I/0 process.

Effect  The LIOP frame is discarded. The operation continues; however, requests might be lost as the result of this error.

Recovery  Contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



213

notif-type notification not delivered Port Name: port-name Cause: not-notif-cause Send Complete Error: send-err MSGSYS Error: msg-sys-err

notif-type

is the type of SNATR notification that could not be delivered. The possible values are: Received-AggSDY, LS_Connected, Flow_Control, and Status_Change_Notice.

fname

is the name of the object that is reporting this event.

not-notif-cause

indicates the reason why the SNATR notification could not be delivered. The possible values of not-notif-cause are:

Link_Failed

indicates that the SNAX line-handler process or path to the SNAX line-handler process is down.

Window_Exceeded

indicates that the Transmit Window to SNAX is full.

SAP_Closed

indicates that the SAP (identified by the port name) has already been closed by SNAX.

Out_of_Memory

indicates that the PAM process ran out of memory, possibly because of heavy traffic.

send-err

is the send completion error received by the PAM process when it attempted to send this notification.

msg-sys-err

is the message system error received by the PAM process when it attempted to send this notification.

Cause  The cause of this error is described in not-notif-cause.

Recovery  Loss of notifications to the SNAX line-handler process generally results in SNAX timing out and reestablishing the session. When a Link_Failed or a Window_Exceeded error is received, SNAX should reopen the SAP and reestablish the session.

If the Out_of_Memory or Window_Exceeded error is temporary, subsequent notifications will be delivered and no action is necessary. However, if these errors are not temporary, you should contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



214

req-type request not sent Port Name: fname Cause: not-sent-cause

req-type

is the type of request from SNAX that could not be sent down to the Token_Ring ServerNet adapter. The possible values of req-type are: Send_AggSdu, Open_Sap, Open_Ls, Connect_LS, Flow_Control, Close_Ls, and Close_Sap.

fname

is the name of the object that is reporting this event.

not-sent-cause

is the reason why the request could not be sent. The possible values of not-sent-cause are:

Read Link Failed

indicates that the data associated with the request could not be read.

Lif Failed

indicates that the LIF went down. The PAM process will close the port for this SAP.

Cause  The cause of this error is described in not-sent-cause.

Recovery  If the cause of the error is LIF Failed, SNAX is able to reestablish the session when the LIF comes up; no action is necessary. If the cause of the error is Read Link Failed, you should contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



215

Backup could not be started. Name: name Error: #err-num Error Detail: err-detail

name

is the name of the process for which the backup could not be started.

err-num

is the error number reporting the reason why the backup didn’t start.

err-detail

detailed information about the error information.

Cause  The PAM backup process cannot start.

Effect  The backup PAM process is not created. The primary process continues to run and tries to restart the backup until the backup process starts.

Recovery  If the problem persists, contact your support representative and provide all relevant information as follows:

  • Descriptions of the problem and accompanying symptoms

  • Details from the message or messages generated

  • Supporting documentation such as Event Management Service (EMS) logs, trace files, and a processor dump, if applicable

If your local operating procedures require contacting the Global Mission Critical Solution Center (GMCSC), supply your system number and the numbers and versions of all related products as well.



216

notif-name notif-func error notif-err. Info: (notif-info1, notif-info2). Module: mod-name Line: line-num

notif-name

is the entity (such as a LIF) in PAM associated with the error.

notif-func

is the name of the function that failed.

notif-err

is the error encountered in the function.

notif-info (1 and 2)

provide context pertaining to the error.

mod-name

is the name of the PAM module reporting the error notification.

line-num

is the line number in the module reporting the error notification.

Cause  The cause depends on the notif-func and notif-err.

Effect  PAM continues to run but the error could affect data transfer.

Recovery  If the PAM process does not recover or the data transfer does not continue, using SCF, check and log the STATUS, INFO and STATISTICS of the PAM line and its subordinate objects as well as the STATUS of the associated LIF, PIF and SAC objects.

If the LIF and PIF are STOPPED or the PAM's primary and backup processor are not in the LIF access list, determine why the access changed, and alter the access list to PAM's primary and backup processor; or restart PAM on the processors that have access to the LIF.

If LIF and the LIF access are OK, take an online processor dump of the primary and backup processors and abort and restart the PAM process. If you still have problems, contact your service provider with the processor dumps and the SCF and EMS logs.