-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update ATspectrograph failed doc and created ATcamera-recovery doc
- Loading branch information
1 parent
b6a5671
commit a169ff8
Showing
6 changed files
with
221 additions
and
57 deletions.
There are no files selected for viewing
160 changes: 160 additions & 0 deletions
160
AuxTel/AuxTel-Troubleshooting/LATISS-Troubleshooting/ATcamera-recovery.rst
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,160 @@ | ||
.. This is a template for troubleshooting when some part of the observatory enters an abnormal state. This comment may be deleted when the template is copied to the destination. | ||
.. Review the README in this procedure's directory on instructions to contribute. | ||
.. Static objects, such as figures, should be stored in the _static directory. Review the _static/README in this procedure's directory on instructions to contribute. | ||
.. Do not remove the comments that describe each section. They are included to provide guidance to contributors. | ||
.. Do not remove other content provided in the templates, such as a section. Instead, comment out the content and include comments to explain the situation. For example: | ||
- If a section within the template is not needed, comment out the section title and label reference. Include a comment explaining why this is not required. | ||
- If a file cannot include a title (surrounded by ampersands (#)), comment out the title from the template and include a comment explaining why this is implemented (in addition to applying the ``title`` directive). | ||
.. Include one Primary Author and list of Contributors (comma separated) between the asterisks (*): | ||
.. |author| replace:: *Tony Johnson* | ||
.. If there are no contributors, write "none" between the asterisks. Do not remove the substitution. | ||
.. |contributors| replace:: *Erik Dennihys* | ||
|
||
.. This is the label that can be used as for cross referencing this procedure. | ||
.. Recommended format is "Directory Name"-"Title Name" -- Spaces should be replaced by hyphens. | ||
.. _Templates-Title-of-Troubleshooting-Procedure: | ||
.. Each section should includes a label for cross referencing to a given area. | ||
.. Recommended format for all labels is "Title Name"-"Section Name" -- Spaces should be replaced by hyphens. | ||
.. To reference a label that isn't associated with an reST object such as a title or figure, you must include the link an explicit title using the syntax :ref:`link text <label-name>`. | ||
.. An error will alert you of identical labels during the build process. | ||
######################### | ||
AT camera recovery | ||
######################### | ||
|
||
|
||
.. _Title-of-Troubleshooting-Procedure-Overview: | ||
|
||
Overview | ||
======== | ||
|
||
.. In one or two sentences, explain when this troubleshooting procedure needs to be used. Describe the symptoms that the user sees to use this procedure. | ||
The camera is designed to go into ``FAULT`` state whenever a limit (temperature/voltage/current/etc) | ||
goes out of tolerance (for limits there is typically a warning range before a hard error occurs), | ||
or if some unexpected failure occurs during camera operation. Once the camera goes info fault state | ||
it is necessary to diagnose the problem, fix it, and then put the camera back into ``ENABLED`` mode | ||
before it is possible to resume operations. This document describes the general procedure for doing this, | ||
and will document any known common failure modes. | ||
|
||
This article was triggered by `OBS-97`_ - The LATISS camera got timeout from REB IN PROGRESS on 28 February 2023, | ||
but is more general than that specific incident. | ||
|
||
.. _OBS-97: https://rubinobs.atlassian.net/browse/OBS-97 | ||
|
||
|
||
.. Following note was below in the original page https://confluence.lsstcorp.org/display/OOD/ATCamera+Recovering+from+Fault+state | ||
.. note:: | ||
The instructions below assume: | ||
#. The ability to login to the AuxTel CCS computers, | ||
#. Some familiarity with basic CCS commands/functionality. | ||
|
||
We need a separate document to provide this background information since it will need to be referred to from multiple places. | ||
|
||
|
||
.. _Title-of-Troubleshooting-Procedure-Error-Diagnosis: | ||
|
||
Error diagnosis | ||
=============== | ||
|
||
.. This section should provide simple overview of known or suspected causes for the error. | ||
.. It is preferred to include them as a bulleted or enumerated list. | ||
.. Post screenshots of the error state or relevant tracebacks. | ||
.. Added error diagnosis | ||
- ATCamera goes to `FAULT` state. | ||
|
||
.. _Title-of-Troubleshooting-Procedure-Procedure-Steps: | ||
|
||
Procedure Steps | ||
=============== | ||
.. This section should include the procedure. There is no strict formatting or structure required for procedures. It is left to the authors to decide which format and structure is most relevant. | ||
.. In the case of more complicated procedures, more sophisticated methodologies may be appropriate, such as multiple section headings or a list of linked procedures to be performed in the specified order. | ||
.. For highly complicated procedures, consider breaking them into separate procedure. Some options are a high-level procedure with links, separating into smaller procedures or utilizing the reST ``include`` directive <https://docutils.sourceforge.io/docs/ref/rst/directives.html#include>. | ||
.. In general the steps are: | ||
#. Identify which **CCS subsystem triggered** the problem | ||
#. Review the **raised alerts and/or log files**, and determine IF: | ||
#. This was a transitory problem which can be documented (via JIRA ticket) and reset, | ||
#. or something which requires a camera expert to diagnose. | ||
#. Clear the raised alerts in both the CCS subsystem which triggered the problem and the Master Control Module (MCM) which tracks the overall camera state. | ||
#. Clear the fault in the ocs-bridge, and switch it back of OFFLINE_AVAILABLE mode. | ||
|
||
.. note:: | ||
In either case it is important that an **OBS ticket** be created so we can track how often specific problems occur, and whether software or hardware changes are needed to prevent future occurrences. | ||
|
||
Specific CCS commands for performing these operations are documented below. | ||
|
||
|
||
.. _Title-of-Troubleshooting-Procedure-tracking-down-a-CSC-problem: | ||
|
||
|
||
Tracking down a CSC problem | ||
-------------------------------- | ||
In general there are two approaches on tracking down a CCS problem, | ||
either using the **ccs-shell** command line tool, or using the **ccs-console** graphical interface. | ||
Currently we describe only the first approach. | ||
|
||
|
||
|
||
.. warning:: | ||
Pending **TODO**: Simulate a fault and verify these commands are correct (perhaps on TTS) (plus highlight responses) | ||
|
||
.. this note was added to be able to copy the commands without cs. BUT UNCERTATIN WHETHER IF IT'S CORRECT OR NEEDED. | ||
.. admonition:: Important | ||
|
||
The following commands have the prompt `ccs>` | ||
|
||
#. Identify which CCS subsystem triggered the problem: | ||
|
||
.. code-block:: bash | ||
ats-mcm getRaisedAlertSummary | ||
#. Review the raised alerts and log files | ||
|
||
.. code-block:: bash | ||
ats-fp getRaisedAlertSummary | ||
#. Clear the alerts | ||
|
||
.. code-block:: bash | ||
ats-fp clearAllAlerts | ||
ats-fp getRaisedAlertSummary | ||
ats-mcm clearAllAlerts | ||
ats-mcm getRaisedAlertSummary | ||
#. Clear the ocs-bridge | ||
|
||
.. code-block:: bash | ||
ats-ocs-bridge clearFault | ||
ats-ocs-bridge setAvailable | ||
.. _Title-of-Troubleshooting-Procedure-Post-Condition: | ||
|
||
Post-Condition | ||
============== | ||
|
||
.. This section should provide a simple overview of conditions or results after executing the procedure; for example, state of equipment or resulting data products. | ||
.. It is preferred to include them as a bulleted or enumerated list. | ||
.. Please provide screenshots of the software status or relevant display windows to confirm. | ||
.. Do not include actions in this section. Any action by the user should be included in the end of the Procedure section below. For example: Do not include "Verify the telescope azimuth is 0 degrees with the appropriate command." Instead, include this statement as the final step of the procedure, and include "Telescope is at 0 degrees." in the Post-condition section. | ||
- AT Camera can now be set to the `ENABLED` state. | ||
|
||
|
||
.. _Title-of-Troubleshooting-Procedure-Contingency: | ||
|
||
Contingency | ||
=========== | ||
|
||
If the procedure was not successful, report the issue in `#summit_auxtel`_ and/or activate the :ref:`Out of hours support <Safety-out-of-hours-support>`. | ||
|
||
.. _#summit_auxtel: https://lsstc.slack.com/archives/C01K4M6R4AH | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file added
BIN
+265 KB
.../AuxTel-Troubleshooting/LATISS-Troubleshooting/_static/1_power_cycle_ATSpec.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+446 KB
...l-Troubleshooting/LATISS-Troubleshooting/_static/2_connect_auxtel_EUI_part1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+427 KB
...l-Troubleshooting/LATISS-Troubleshooting/_static/3_connect_auxtel_EUI_part2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+1.16 MB
...uxTel-Troubleshooting/LATISS-Troubleshooting/_static/4_ACE_spec_EUI_Labview.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.