Author stats over multiple repos #70

PriitParmakson · 2020-01-07T18:21:38Z

Wrote a script that merges authors.json files produced by git-of-theseus-analyze, so that authors chart can be produced over multiple repos.

erikbern · 2020-01-07T19:19:44Z

Thanks for the addition. I think this would be cleaner if the stack plotting script would take multiple files on the command line, or what do you think?

PriitParmakson · 2020-01-07T22:19:47Z

It felt safer to make it separately, as it's my first program in Python. I wrote first version in Go, to satisfy my immediate need. Of course, the integrated way is cleaner. I'll look into it, in a week perhaps.

PriitParmakson · 2020-01-11T16:49:09Z

Now there seem to be some side effect. I don't know Travis.

Idea of the merging algorithm:

Each authors.json file defines a function LOC(r, a, t), where
r is repo name,
a ∈ A(r) is author (from the set of authors present in repo), and
t ∈ T(r) is time.

For the plot we need a fully defined function LOC(r, a, t), where
r ∈ R (set of all repos),
a ∈ A (union of authors of repos), and
t ∈ T (union of times of repos).

However, authors.json files provide data only for a partially defined LOC(r, a, t). Fully defined function can be obtained by extrapolation:
If the is no data point for (r, a, t) in authors.json file, then define (r, a, t) = (r, a, t1), where t1 is the latest timepoint, t1 < t, present in the file of r; if there's no such timepoint, then (r, a, t) = 0.

erikbern · 2020-01-12T00:29:40Z

this is a better approach. the code seems pretty convoluted though – feels like it shouldn't be more than a 5-10 lines to accomplish what you want. you just need to add up the stats and account for the fact that the timestamps are irregular right?

leonid-shevtsov · 2022-01-03T13:29:17Z

I don't know if it's too convoluted or not (also not a pythonist), but this PR sure helped me produce a chart for all of our repos together 👍

drew2a · 2023-11-20T15:12:49Z

I don't know if it's too convoluted or not (also not a pythonist), but this PR sure helped me produce a chart for all of our repos together 👍

It was helpful for me as well. Thank you! @PriitParmakson

Acehood713aa · 2024-10-07T01:38:18Z

git_of_theseus/stack_plot.py

@@ -74,7 +126,8 @@ def stack_plot_cmdline():
    parser.add_argument('--max-n', default=20, type=int, help='Max number of dataseries (will roll everything else into "other") (default: %(default)s)')
    parser.add_argument('--normalize', action='store_true', help='Normalize the plot to 100%%')
    parser.add_argument('--dont-stack', action='store_true', help='Don\'t stack plot')
-    parser.add_argument('input_fn')


Acehood713aa · 2024-10-07T01:39:02Z

.travis.yml

-before_script: # configure a headless display to test plot generation
-  - "export DISPLAY=:99.0"
-  - "sh -e /etc/init.d/xvfb start"
-  - sleep 3 # give xvfb some time to start


Author stats over multiple repos

1ee92e6

Priit Parmakson added 2 commits January 11, 2020 17:49

Integrated multi-repo option into stack_plot; added two tests.

9fa9137

Put .gitignore into wrong directory. Trying again.

7596c0f

Priit Parmakson added 2 commits January 11, 2020 22:09

Updated xvfb in travis.yml to Ubuntu 16.04 (Xenial)

190201e

Removed erroneus line from __init__.py

d266a2a

erikbern mentioned this pull request Mar 2, 2023

Cumulated statics on multi-repo environement #88

Closed

drew2a mentioned this pull request Nov 20, 2023

Journal drew2a/ivory-tower#1

Open

Acehood713aa approved these changes Oct 7, 2024

View reviewed changes

Acehood713aa reviewed Oct 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Author stats over multiple repos #70

Author stats over multiple repos #70

PriitParmakson commented Jan 7, 2020

erikbern commented Jan 7, 2020

PriitParmakson commented Jan 7, 2020

PriitParmakson commented Jan 11, 2020

erikbern commented Jan 12, 2020

leonid-shevtsov commented Jan 3, 2022

drew2a commented Nov 20, 2023

Acehood713aa Oct 7, 2024

Acehood713aa Oct 7, 2024

Author stats over multiple repos #70

Are you sure you want to change the base?

Author stats over multiple repos #70

Conversation

PriitParmakson commented Jan 7, 2020

erikbern commented Jan 7, 2020

PriitParmakson commented Jan 7, 2020

PriitParmakson commented Jan 11, 2020

erikbern commented Jan 12, 2020

leonid-shevtsov commented Jan 3, 2022

drew2a commented Nov 20, 2023

Acehood713aa Oct 7, 2024

Choose a reason for hiding this comment

Acehood713aa Oct 7, 2024

Choose a reason for hiding this comment