Skip to content

Rucio Traces

Paul Nilsson edited this page Apr 25, 2022 · 1 revision

Note: this page is VO specific / users of Rucio

The Pilot sends detailed information about file transfers to Rucio. Here is a list of the different fields contained in the trace report.

Field name Type Meaning
appid String PanDA job id
catStart Float Time stamp when Rucio has been queried for replica information
clientState String State at the time the trace is sent; e.g. INIT_REPORT, STAGEIN_FAILED, NO_REPLICA, STAGEIN_NOTALLOWED, STAGEIN_ATTEMPT_FAILED, STAGEOUT_ATTEMPT_FAILED, DONE
dataset String Dataset name (prodDBlock from job definition - or destinationDblock)
duid   Currently not set by the Pilot
eventType String

Type of trace; currently the following event types are sent by the pilot (ordered by frequency; in the case of user jobs, an _a is added to the type):

  • get_sm_a: stage-in inputs for user analysis jobs
  • get_sm: stage-in inputs for non-analysis jobs
  • put_sm: stage-out outputs for non-analysis jobs
  • get_es: stage-in inputs for eventservice jobs
  • download: EXPLANATION MISSING
  • put_sm_logs: stage-out logs for non-analysis jobs
  • put_sm_a: stage-out outputs for analysis jobs
  • put_sm_logs_a: stage-out logs for analysis jobs
  • upload: EXPLANATION MISSING
  • put_es: stage-out outputs for eventservice jobs
  • put_sm_logs_os: stage-out logs to objectstore
eventVersion String Pilot version
filename String The local file name (LFN)
filesize Integer The file size
guid String A unique file identifier. The guid sent with the traces has the '-' signs removed
hostname String Host name as returned by socket.gethostbyaddr(socket.gethostname())[0]
ip String IP number of the local host
localSite String Local site name as defined by the RUCIO_LOCAL_SITE_ID environmental variable. If this is not set, then use the DDM endpoint
pq String PanDA site name
protocol String Name of copy tool used by the Pilot; e.g. xrdcp
relativeStart Float Transfer start time. Currently the same time stamp as transferStart
remoteSite String DDM endpoint
scope String Replica scope
stateReason String Error message or explanation; e.g. BAD_COPYTOOL, OK, 'skip stagein file'
suspicious String Currently not being reported by the pilot (always set to '0')
taskid String PanDA task ID
timeEnd String Time stamp when transfer or replica lookup, etc has been finished/failed by the pilot
timeStart Float Start time of the trace report
transferStart Float Transfer start time. Currently the same time stamp as relativeStart
url String TURL in case of direct access for a given replica
usr String The hash of the user DN (usrdn field); hashlib.md5(job.prodUserID).hexdigest()
usrdn String The distinguished name of the user (DN)
uuid String The hash of the JobDefinitionID (from the job definition)
validateStart Float Time stamp when the Pilot performs checksum verification during stage-in and stage-out
version String Currently not used by the Pilot