Add Logging Statements for Performance Analysis #137

TeachMeTW · 2024-10-02T18:43:40Z

Description

Introduces logging statements to various functions primarily located in utils.py and data page callbacks -- more to be added. The logging format has been standardized to provide clarity and consistency, structured as follows:

Format:
[Component] - [Function/Callback Name] - [Stage] - [Details]

Example:
Utils - Trajectories Stage 1 - Execution Time: 150ms

The primary goal of these logging enhancements is to facilitate the identification of performance bottlenecks and to enable the export of timing data for further analysis.

Changes Made

Added logging statements to key functions in utils.py.
Implemented logging in relevant data page callbacks.
Each log entry includes:
- The component (e.g., Utils or Callback)
- The specific function or callback name
- A stage identifier
- Relevant details, such as execution time

Rationale

By incorporating these logging statements, we aim to:

Gain better visibility into the execution time of critical functions.
Identify performance bottlenecks within the application.
Facilitate data collection for performance analysis.

Testing

Verified that logging statements are triggered at the appropriate stages in the application.
Confirmed that logs are formatted correctly and contain the expected information.

Next Steps:

Add as stats to emission

JGreenlee

@shankari Any suggestions for cleaner ways to instrument the execution of specific sections of code?

My first thought was to create a decorator which records and prints execution time of a function. But this would only work on a per-function basis, and @TeachMeTW thinks that a more granular, per-"Stage" measurement will be necessary.

@TeachMeTW is also wondering if "Time for : x seconds" is adequate or if each of these log statements should include an ISO string.
I thought that logging already included timestamps – is it just a matter of configuring it to show them?

shankari · 2024-10-02T19:43:30Z

Why are we only adding the times as logs? We should put them in as stats (e.g. add_dashboard_stats, similar to add_server_stats so that we can analyze them without having to download and parse logs. Downloading logs from the production environment is particularly challenging. I am fairly sure that is what we discussed at yesterday's meeting.

@JGreenlee the canonical way to instrument specific sections of code in python is to use a Timer in a with block.
You can see how i have used it in the server

        with ect.Timer() as uit:
            logging.info("*" * 10 + "UUID %s: updating incoming user inputs" % uuid + "*" * 10)
            print(str(arrow.now()) + "*" * 10 + "UUID %s: updating incoming user inputs" % uuid + "*" * 10)
            eaum.match_incoming_user_inputs(uuid)

        esds.store_pipeline_time(uuid, ecwp.PipelineStages.USER_INPUT_MATCH_INCOMING.name,
                                 time.time(), uct.elapsed)

I would suggest using the same Timer object.

TeachMeTW · 2024-10-02T19:49:46Z

@shankari I wanted to clarify; would I be using

def store_stats_entry(user_id, metadata_key, name, ts, reading):
  data = {
    "name": name,
    "ts": ts,
    "reading": reading
  }
  new_entry = ecwe.Entry.create_entry(user_id, metadata_key, data)
  return esta.TimeSeries.get_time_series(user_id).insert(new_entry)

also I had a question with the server implementation. The with ect.Timer() as uit is not used. UCT is used. Is this a typo? uct was defined 2 statements above the with statement you provided

shankari · 2024-10-02T20:02:06Z

@TeachMeTW

I am suggesting that you create a new store_dashboard_time function similar to store_pipeline_time function, which will use store_stats_entry internally
ah I think that is an error in my code. Thanks for catching it. Please fix it now!

This also means that all the USER_INPUT_MATCH_INCOMING times are incorrect.

TeachMeTW · 2024-10-02T20:11:52Z

@shankari Sounds good, I will implement both 1) and 2) on e-mission/e-mission-server#986

TeachMeTW added 3 commits October 2, 2024 11:43

Timing

4100030

reverted comment removal

dd42473

naming standardization

9ff2f94

TeachMeTW changed the title ~~Timing~~ Add Logging Statements for Performance Analysis Oct 2, 2024

JGreenlee reviewed Oct 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Logging Statements for Performance Analysis #137

Add Logging Statements for Performance Analysis #137

TeachMeTW commented Oct 2, 2024 •

edited

Loading

JGreenlee left a comment •

edited

Loading

shankari commented Oct 2, 2024

TeachMeTW commented Oct 2, 2024

shankari commented Oct 2, 2024 •

edited

Loading

TeachMeTW commented Oct 2, 2024

Add Logging Statements for Performance Analysis #137

Are you sure you want to change the base?

Add Logging Statements for Performance Analysis #137

Conversation

TeachMeTW commented Oct 2, 2024 • edited Loading

Description

Changes Made

Rationale

Testing

Next Steps:

JGreenlee left a comment • edited Loading

Choose a reason for hiding this comment

shankari commented Oct 2, 2024

TeachMeTW commented Oct 2, 2024

shankari commented Oct 2, 2024 • edited Loading

TeachMeTW commented Oct 2, 2024

TeachMeTW commented Oct 2, 2024 •

edited

Loading

JGreenlee left a comment •

edited

Loading

shankari commented Oct 2, 2024 •

edited

Loading