Accessing Spark logs
Access to Spark logs is available for monitoring and troubleshooting Spark jobs. This page provides instructions for accessing Spark logs using the YARN ResourceManager UI, the yarn logs command, and the JobControl dashboard. The YARN ResourceManager UI and the yarn logs command can only be used from the User Virtual Machine, while the JobControl dashboard is available outside the User Virtual Machine.
YARN ResourceManager UI
The YARN ResourceManager UI provides access to the Spark UI, which contains detailed information about the Spark job, including the DAG visualization, job statistics, and logs.
Access the Spark UI by clicking the ApplicationMaster link in the YARN ResourceManager UI. To access the YARN ResourceManager UI, visit https://epod-master1.vgt.vito.be:8090:
- Use a Firefox web browser from the User Virtual Machine.
- Have a valid Ticket Granting Ticket (TGT); instructions for creating a TGT can be found in Advanced Kerberos.
- Once in the UI, select an application ID to view the details of an application and click on the ApplicationMaster link to get to the Spark UI.
An improved UI is available at https://epod-master1.vgt.vito.be:8090/ui2 or https://epod-master2.vgt.vito.be:8090/ui2, depending on the current master node. This new UI offers an enhanced view of the resources used per queue, among other features.
YARN logs command
The YARN logs command can retrieve logs for a specific application ID. The application ID can be found in the YARN ResourceManager UI.
To get the logs, use the yarn logs command:
yarn logs -applicationId <application_id>
You can also pipe the output to less
to enable scrolling:
yarn logs -applicationId <application_id> | less
The yarn logs command will output logs for different log types:
stdout
: application logs printed tostdout
stderr
: application logs printed tostderr
directory.info
: prints the content of the container working directoryprelaunch.out
,prelaunch.err
,launch_container.sh
: logs related to container start
Filter log types by adding the -log_files_pattern
parameter, for example:
yarn logs -applicationId <application_id> -log_files_pattern stderr | less
JobControl dashboard
The JobControl dashboard (https://proba-v-mep.esa.int/applications/jobdashboard/) provides a web-based UI to access job logs. It can be accessed outside the User Virtual Machine without needing a Kerberos TGT. This dashboard simplifies job monitoring and log retrieval by offering log access for active and finished jobs through a single interface.
In the application list, it is possible to:
- Retrieve logs of both running and finished applications.
- Kill a running application (if the user submitted the job or if admin rights are available).
When retrieving logs, a dialogue will appear showing the job logs. The used executor nodes and log types (stdout
, stderr
, etc.) are shown on the left. Selecting an entry on the left will update the log entries displayed on the right.
By default, logs from all executors are shown. Filtering by executor nodes is possible by typing a string in the “Search text” field, e.g., typing “error” to filter logs containing the word “error.”
Within an executor log, it is possible to search for and navigate between occurrences of specific text. The occurrences will be highlighted:
YARN Spark Access Control Lists (ACLs)
To allow other users or user groups to view Spark job logs they didn’t submit themselves (e.g. when running the Spark job as a service user), add the following configuration parameters to spark-submit
:
These parameters set permissions for viewing logs of running and finished jobs, as well as modifying (including killing) the submitted job.
--conf spark.ui.view.acls=user1,user2 \
--conf spark.ui.view.acls.groups=group1 \
--conf spark.modify.acls=user1,user2
--conf spark.modify.acls.groups=group1