Fixed the pagination problem #4042

okonek · 2018-11-25T19:34:56Z

Fixes #2258 (<=== Add issue number here)

Make sure these boxes are checked before your pull request (PR) is ready to be reviewed and merged. Thanks!

tests pass -- look for a green checkbox ✔️ a few minutes after opening your PR -- or run tests locally with rake test
code is in uniquely-named feature branch and has no merge conflicts
PR is descriptively titled
ask @publiclab/reviewers for help, in a comment below

We're happy to help you get this ready -- don't be afraid to ask for help, and don't be discouraged if your tests fail at first!

If tests do fail, click on the red X to learn why by reading the logs.

Please be sure you've reviewed our contribution guidelines at https://publiclab.org/contributing-to-public-lab-software

Thanks!

plotsbot · 2018-11-25T19:46:18Z

	1 Warning
⚠️	It looks like you merged from master in this pull request. Please rebase to get rid of the merge commits – you may want to rewind the master branch and rebase instead of merging in from master, which can cause problems when accepting new code!

	2 Messages
📖	@okonek Thank you for your pull request! I’m here to help with some tips and recommendations. Please take a look at the list provided and help us review and accept your contribution! And don’t be discouraged if you see errors – we’re here to help.
📖	It looks like you haven’t marked all the checkboxes. Help us review and accept your suggested changes by going through the steps one by one. If it is still a ‘Work in progresss’, please include ‘[WIP]’ in the title.

Generated by 🚫 Danger

jonxuxu · 2018-11-25T19:46:40Z

Hey there, thanks for making this pull request! It's a good solution to the issue, but there are some issues that we should fix in the codeclimate so that the checks pass. I'll request some of them in the code.

okonek · 2018-11-25T19:47:55Z

The mechanics of sorting are completely rewritten and the sort operation now happens in the backend, so with every filter change the website is reloaded.

okonek · 2018-11-25T19:50:17Z

Ok, I'll try to solve them.

oorjitchowdhary · 2018-11-25T19:51:47Z

I believe if the Travis Ci and Danger checks have passed, we may be able to ignore the CodeClimate issues..
@publiclab/reviewers Shall tell the exact steps..
@okonek Thanks for your PR.

okonek · 2018-11-25T20:13:04Z

Now that the codeclimate tests pass I think there is no problem with the PR.

sashadev-sky · 2018-11-25T21:08:33Z

@okonek I am having issues passing Travis CI tests and I saw in their pull requests bank that you had a similar error to mine which you managed to fix:

The command "if [ "$TRAVIS_PULL_REQUEST" != "false" ]; then danger --verbose; fi" exited with 1

I was wondering if you could tell me what was causing this error for you / how you fixed it please!

okonek · 2018-11-26T12:28:33Z

Could you please link this PR, because I don't remember where did I have this issue?

okonek · 2018-11-26T12:51:19Z

Ok, just added another commit to this PR, because I just realised a small issue, that is now resolved. Could someone please review my task on GCI?

jywarren · 2018-11-26T20:56:25Z

Wow this is a pretty involved PR! Thanks! Would anyone from @publiclab/reviewers or @publiclab/mentors be able to give it a close review? Thanks!!!!

okonek · 2018-11-27T14:40:41Z

Yes, I worked pretty hard on that one. Please review. I'm waiting for two days.

grvsachdeva · 2018-11-27T22:44:25Z

Gemfile.lock

@@ -60,7 +60,6 @@ GEM
      scrypt (>= 1.2, < 4.0)
    authlogic-oid (1.0.4)


Please remove Gemfile.lock from your PR, as we update it only when required. Thanks!

jywarren · 2018-11-27T22:44:59Z

Hi, @okonek - apologies, it takes a bit to review a longer PR, and it looks like many mentors are tied up in exams at the moment. We appreciate your patience.

@gauravano perhaps we should push this to unstable to test it out live, as well?

grvsachdeva · 2018-11-27T22:45:10Z

app/assets/javascripts/dashboard.js

@@ -1,15 +1,19 @@
+/* eslint-disable complexity */
+/* eslint-disable wrap-iife */


I don't think these lines are needed.

grvsachdeva · 2018-11-27T22:47:18Z

Agreed @jywarren, I am pushing it on unstable.

jywarren · 2018-11-27T23:10:07Z

Should be testable here: https://unstable.publiclab.org/dashboard

jywarren · 2018-11-27T23:11:05Z

Oh, odd... @icarito did something more change in our subdomains? I can't seem to load unstable as it goes directly to Jenkins...

grvsachdeva · 2018-11-27T23:12:21Z

Let's see how it works after the build finishes .

jywarren · 2018-11-27T23:28:04Z

app/assets/javascripts/dashboard.js

    }
+    var baseurl = window.location.href;
+    url = new URL(baseurl);


Hi, reading through this carefully, it's very well crafted, and thank you!

One thing that was true about the previous version was that the types shown did not have to be specified in the URL. I'm thinking about the typical use case of someone coming to /dashboard -- does this code refresh the page in order to send that information to the controller? I am not sure that's the perfect solution here but I'm trying to understand how you've approached the problem.

If I'm misunderstanding, could you add some comments to explain the sequence, as the comments had been in the original version? I know it's some extra work, but this is a complex interaction here and we definitely want to get it right and make sure it's readable by future developers too!

Thanks so much for your work on this. We'll hopefully get the 'unstable' branch material working again so it'll be easier to review, too.

grvsachdeva · 2018-11-27T23:32:31Z

Unstable is running as build finished.

jywarren · 2018-11-27T23:34:37Z

oh oh oh WOW it's a new feature! Showing Jenkins until it's done!!! Wow!!!

grvsachdeva · 2018-11-27T23:43:31Z

I think I got your approach. Currently, we take all the notes, wikis, comments and then go to dashboard, and pagination show count totaling the sum. When user try to show notes only, or wikis only, other nodes get invisible 👓 but count remains same. For instance, suppose there are 1000 total notes displaying over 100 pages. And, there are 10 events in them, a event can be anywhere 8th page, 9th page.... Now, if we wish to show only events, other nodes(notes, wikis, questions get invisible) and only events show but on their same positions i.e., on 8th page, 9th page, etc and other pages appear empty with same count of pages because in reality other nodes are there only.

So, you are reloading page in this PR and loading the content which is checked. Am I right?

jywarren · 2018-11-27T23:54:14Z

What if we started showing the type only once someone begins clicking the pagination... So normally at /dashboard it would show pagination for all, but once you start going back to older pages like page=2, it begins to use your filtering to show more accurate pagination by also appending types=comments,questions, etc? It also isn't perfect but it's a good compromise and also a good adaptation of the code that's now been written?

…

On Tue, Nov 27, 2018, 6:44 PM Gaurav Sachdeva ***@***.*** wrote: I think I got your approach. Currently, we take all the notes, wikis, comments and then go to dashboard, and pagination show count totaling the sum. When user try to show notes only, or wikis only, other nodes get invisible 👓 but count remains same. For instance, suppose there are 1000 total notes displaying over 100 pages. And, there are 10 events in them, a event can be anywhere 8th page, 9th page.... Now, if we wish to show only events, other nodes(notes, wikis, questions get invisible) and only events show but on their same positions i.e., on 8th page, 9th page, etc and other pages appear empty with same count of pages because in reality other nodes are there only. So, you are reloading page in this PR and loading the content which is checked. Am I right? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#4042 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABfJyHaD4MlRkIImXPUAxqIQQbYRsLrks5uzc4jgaJpZM4YyBuC> .

grvsachdeva · 2018-11-28T00:04:22Z

I am testing the new dashboard on unstable and I feel there are some issues:

Everytime checkbox is clicked, the whole page loads
Checkbox having value ALL not working
I think loading is a issue

@jywarren do you think we should change layout of dashboard something like tabs on profile page - https://publiclab.org/profile/warren . Do you think it can be discussed in Openhour?

okonek · 2018-11-30T21:27:02Z

@jywarren And? I know you are busy, but I wait for very long.

jywarren · 2018-11-30T22:08:10Z

Hi, @okonek - thanks for your patience. The main reason I prefer the solution you've finished (as mentioned in this comment, and adding the "No results - try broadening your filter to see more." message. First, it doesn't require a page refresh upon arriving at /dashboard if there are selections made (the javascript just displays the correct types). I think the page refreshing will be pretty disruptive to usability, i'm sorry to say! Would you please add the message above, and we can merge this?

The only other solution I can think of is to store the settings in the session, or in the database, so that the server side can "know" to display the right types, and we wouldn't have to rely on JavaScript. But the JavaScript solution has been in place for a long time and people haven't had a major problem with it, so I think we might as well stick with what works rather than introduce a page refresh that happens a lot.

Thanks for working on this one, I think we can wrap it up now!

jywarren · 2018-11-30T23:07:32Z

Yes, that's fine with me! Although I've love to see the latest code Jan added. Thanks!

…

On Fri, Nov 30, 2018 at 5:24 PM Gaurav Sachdeva ***@***.***> wrote: I think the GCI task can be approved based on the proposed solution. What do you think @jywarren <https://github.com/jywarren> ? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#4042 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABfJ1ctaAPGGdnc-rL8s2_BL_KLCWZrks5u0a_0gaJpZM4YyBuC> .

okonek · 2018-12-01T10:51:58Z

@jywarren I think I resolved the issue perfectly now. With small changes page now fetches filtered data using AJAX, so there is no reloading. I also add a gif of how it works now. If there are no code errors, I think it is mergable.
Thanks for all your help.

okonek · 2018-12-01T12:16:09Z

I also resolved the issue of filtering questions.

app/assets/javascripts/dashboard.js

jywarren · 2018-12-01T17:11:27Z

app/assets/javascripts/dashboard.js

    $('.activity .col-md-6').css('display', '');
    getLocalStorageActivity();
-
  });


  $('.activity-dropdown .dropdown-toggle').click(function(e) {


OK, this is now empty - shall we remove it?

What do you mean? What should be removed?

jywarren · 2018-12-01T17:31:33Z

app/controllers/home_controller.rb

+    basenotes = basenotes.where('nid != (?)', blog.nid) if blog
+    notes = basenotes
+
+    questions = Tag.find_nodes_by_type('question:question', 'note', 999_999_999_999_999)


Here you should be able to pass nil and the limit will be lifted. But unfortunately, this will query the database for all question nodes, and likewise below all event nodes and above all notes -- on the production server this will be thousands of records, so we need to optimize for it to work. The .page() method is quite sophisticated, and uses limit() and offset() to select just the right records to display on the given page. But the problem we're encountering here is that we are trying to subselect across different tables. I don't believe the will_paginate gem can do this.

What we could do here is to start by fetching the paginated notes, by leaving in .page() for that query. For events and questions, which are also nodes, we could try to filter them in/out if they are not selected... this could be difficult but we could re-use some of the code in Tag.find_nodes_by_type, maybe...

Then, we could use the timestamps from the nodes we fetched to select the comments which were made during the same period. These could be added in, and while the calculation of total # of pages would be correct, the number of items per page would vary since we would be calculating # of pages based only on nodes.

I'm trying to think of how this could be done more simply but it was already a pretty complex thing to display in the first place... the critical part now is to not return the entire table's worth of each type on each page load. This works on a very small dataset but would almost certainly crash the server when 1000s of records are returned each time in production.

https://stackoverflow.com/a/7959348 suggests synchronizing the offset, but then over time the comments and nodes would drift because they don't occur at the same rate over time.

OK, here's an idea. I don't know if it's good, but just to think through it...

Let's simplify - let's put wiki edits permanently in the sidebar and display them below in mobile view, so we just don't worry about wikis as part of this. Then we're left with Notes, Events, Questions (all types of Node) and Comments.

The first three we could probably filter if we wanted based on what tags they have, using joins. But Comments messes us up, because it's a different table (see my above comment). What if we, instead of showing pure comments, showed only Nodes in Activity, but we joined comments where they exist (let's forget Answer comments for now) and we sorted by comment.timestamp and then node.created: https://api.rubyonrails.org/classes/ActiveRecord/QueryMethods.html#method-i-order -- would that work? And then in the views, we detect if there is a comment attached and we display the X commented on Y Z time ago instead of the usual note template?

This isn't air-tight, but seems plausible. I'm sorry I missed the change in pagination from the very first commit. I'm going to think on this more and happy to hear your thoughts too.

In the above scenario, the right type of join will return multiple duplicate Node records, but one for each comment, and ordered by comment timestamp. I think that's fine, because each would really be a "stand in" for the associated comment record. But alternatively we could try to use a join type which would return only unique/discrete Node records, but choose the most recent associated comment record. Then we could show the Node template on the dashboard, and kind of "attach" the most recent comment to the bottom of it, so as to show why it's appearing in the timeline at that point. Just thinking through possibilities here.

Finally, because this is already quite complex, we could stop offering to filter by Event and Question, simply grouping all Nodes together. That would simplify things a good bit, leaving only All, "Posts" (we'll call them?), and Comments. Trying to think of how to make this a reasonable path forward.

You know what? I have a simpler idea. I know it's not that great but it's better than fetching thousands. We could just fetch the first 37 from all of the tables, because it's the nodes limit for one page. You know what I mean? It's a temporary solution, but it's not that bad and it could be solved in another PR.

I think this is doable, and I agree with the principle of doing something now and following up in later PRs. But this would start to go out of time sync pretty fast because there are more comments per time period than nodes. Could we try the simpler solution but using timestamps instead of record count?

So you'd narrow your query with a where() and I think you can even pass a range to where() in activerecord... then it'd be like .where(timestamp: basenotes.first.created..basenotes.last.created) i think?

jywarren · 2018-12-01T18:05:46Z

@publiclab/reviewers @publiclab/mentors we ran into a REALLY tough issue in @okonek's PR here... a true puzzle trying to develop some solutions for. @okonek has done some tremendous work on the problem but it's revealed an even larger challenge for database query optimization on the dashboard. If anyone likes that kind of problem, we'd be grateful for some ideas or input; see my comment at: https://github.com/publiclab/plots2/pull/4042/files#r238070635

okonek · 2018-12-01T18:07:56Z

@jywarren Could you accept limiting every query to 37 nodes? The maximum node count per page? Please, I'd like to be done with this issue.

jywarren · 2018-12-01T18:15:18Z

Check out my comment above -- limiting by timestamp instead -- i think that is pretty good.

If you are really eager to move on to another issue, I totally understand. This one was extremely challenging, to tell the truth, especially with the added issue of database optimization. Your solution is super nice - the Ajax is really wonderful. It's just an issue which, if we could go back, perhaps should have been solved in smaller parts and architected more thoroughly.

oorjitchowdhary · 2018-12-01T18:22:28Z

Hi @okonek @jywarren, I may have not involved in the discussion here.. but I just found out that Jan has added yarn.lock in the PR which I believe is redundant here...
Apologies if I'm wrong

okonek · 2018-12-01T18:25:43Z

@jywarren Ok, I don't want to leave this issue with a bad solution. Could you better describe your idea of limiting by timestamps? Thanks

okonek · 2018-12-01T18:30:25Z

And could you tell me why limiting by 37 is bad? I mean, I can't find a case where it wouldn' work. Thanks

jywarren · 2018-12-01T18:31:30Z

Yes I will but it'll have to be tonight as I have some work now. But it's just like limiting to 37 but using a time boundary instead of a count. I can offer some more code hints tonight however. Thanks, and especially since we did not describe this particular challenge in the original issue, don't worry, you'll definitely be getting credit for this PR! Thanks again, J

…

On Sat, Dec 1, 2018, 1:26 PM Jan Okoński ***@***.*** wrote: @jywarren <https://github.com/jywarren> Ok, I don't want to leave this issue with a bad solution. Could you better describe your idea of limiting by timestamps? Thanks — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#4042 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABfJ1pEX_6JITC34snYmULgoIjZ0lpOks5u0smngaJpZM4YyBuC> .

okonek · 2018-12-01T18:33:44Z

I'm just thinking loud, but I think that neither my solution or yours would work, because this limiting would break the pagination if we would like to parse an old site like 4, 5 etc

okonek · 2018-12-01T18:38:11Z

I really don't know what to do. I'm stuck with this issue for very long

okonek · 2018-12-01T19:22:06Z

@jywarren Any tip?

jywarren · 2018-12-01T22:18:11Z

Don't worry. I'll spend some time on this tonight.

…

On Sat, Dec 1, 2018, 2:22 PM Jan Okoński ***@***.*** wrote: @jywarren <https://github.com/jywarren> Any tip? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#4042 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABfJ2uSTmWLwJET-q15RsAXqTUem6e2ks5u0tbfgaJpZM4YyBuC> .

jywarren · 2018-12-02T00:26:35Z

Hi @okonek -- I've spent a long time thinking about this one, and I am afraid the best way forward for now is to leave it as it is, and close this PR. I really can't think of a way to separately paginate questions, wikis, comments, answer comments, events, and notes, and any combination.

I want to apologize that the original issue made it seem that this was an easier problem than it really was, and I should have noticed that and suggested we break it into smaller parts, or solve it a different way. There's a lot of dense delicate code here, and it's not really great code either.

I would like to take steps to simplify and optimize this code. But I think there may be related issues we could work on that would be ways to do this in smaller pieces. Let me suggest a few that may be interesting to you as you now understand this code very well.

We could totally remove this caching code as it's no longer used; requests to set_activity could go directly to activity:

plots2/app/controllers/home_controller.rb

Lines 196 to 206 in 047bdb9

    
           def set_activity(source = :database) 
        
             @activity, @blog, @wikis, @revisions, = 
        
               if source == :cache 
        
                 # we no longer use activity feed on front page ('home'), so this cache may be unused 
        
                 Rails.cache.fetch("front-activity", expires_in: 30.minutes) do 
        
                   activity 
        
                 end 
        
               else 
        
                 activity 
        
               end 
        
           end

We can remove the entire lists section which is no longer used: https://github.com/publiclab/plots2/pull/4042/files#diff-398e9dc91ab41e7d183e081c72f8f161L189

I've also created four new refactoring issues which may be of interest. You've demonstrated a deep grasp of Rails code through this very challenging issue, and I want to offer you some tasks which may help you to build some real-world skills in refactoring, reorganizing, and managing larger code bases. If one of the top four here interests you, we'd love to have your help with one!

refactorization

I want to say that you've done some really impressive work here. Don't be discouraged that we weren't able to figure it out. Sometimes unexpected challenges come up as we break down a problem, and it challenges us to try something very different. We become better coders when this happens.

Maybe we'll suddenly have an idea for how to do this set of functions better. Let's keep it in mind as we move forward. There is a dashboard redesign project coming soon as well, and it could be an opportunity to try something different.

@okonek -- i'm going to give you credit for a viable solution to the original problem of pagination, even though new problems came up in the pursuit of this. As they weren't part of the original task, we'll consider that outside the scope of the challenge, although you did a great job thinking through it.

Thanks for everything! If it's all right with you, we can close this PR, but the discussions here will be a good guide if and when we come back to this problem.

okonek · 2018-12-02T09:05:01Z

Thank you very much for a tremendous help on that one. I can't say I was working on this alone. This problem is a lot bigger than anyone could thought. Anyways thank you for all your help and I am sure I've learned a lot from this issue.
John

SidharthBansal · 2018-12-06T19:54:53Z

@okonek as you have worked on this challenge for quite a long time. We want to give you credit for this challenge. You can claim the task, I will approve it.
Thanks for tremendous amount of work

SidharthBansal · 2018-12-06T19:57:34Z

@jywarren has already given you points.

okonek mentioned this pull request Nov 25, 2018

Pagination broken at /research #2258

Open

okonek added 3 commits November 25, 2018 20:30

Fixed the pagination problem

a72b9d5

Fixed some codeclimate issues

010f8cf

Removed whitespace in home_controller.rb

3394880

Removed yet another whitespace from home_controller.rb

e5a4e1c

Fixed the problem where only the checkbox in filters is clickable

704a79c

grvsachdeva reviewed Nov 27, 2018

View reviewed changes

jywarren reviewed Nov 27, 2018

View reviewed changes

okonek added 2 commits December 1, 2018 11:49

Now the filters are applied using ajax without reloading

e1b2741

Fixed codeclimate issues

4af8b80

okonek added 2 commits December 1, 2018 13:10

Fixed bad question filtering

467a6ee

Merge branch 'master' into fix_pagination_broken

1278979

jywarren reviewed Dec 1, 2018

View reviewed changes

app/assets/javascripts/dashboard.js Show resolved Hide resolved

jywarren reviewed Dec 1, 2018

View reviewed changes

Limited the number of requested nodes to 37

047bdb9

jywarren mentioned this pull request Dec 4, 2018

"Wiki" filter is displayed only in mobile view at /research #4026

Closed

jywarren closed this Dec 4, 2018

		@@ -60,7 +60,6 @@ GEM
		scrypt (>= 1.2, < 4.0)
		authlogic-oid (1.0.4)

		@@ -1,15 +1,19 @@
		/* eslint-disable complexity */
		/* eslint-disable wrap-iife */

Fixed the pagination problem #4042

Fixed the pagination problem #4042

Conversation

okonek commented Nov 25, 2018 • edited Loading

plotsbot commented Nov 25, 2018 • edited Loading

jonxuxu commented Nov 25, 2018

okonek commented Nov 25, 2018

okonek commented Nov 25, 2018

oorjitchowdhary commented Nov 25, 2018

okonek commented Nov 25, 2018

sashadev-sky commented Nov 25, 2018

okonek commented Nov 26, 2018

okonek commented Nov 26, 2018

jywarren commented Nov 26, 2018

okonek commented Nov 27, 2018

Choose a reason for hiding this comment

jywarren commented Nov 27, 2018

Choose a reason for hiding this comment

grvsachdeva commented Nov 27, 2018

jywarren commented Nov 27, 2018

jywarren commented Nov 27, 2018

grvsachdeva commented Nov 27, 2018

Choose a reason for hiding this comment

grvsachdeva commented Nov 27, 2018

jywarren commented Nov 27, 2018

grvsachdeva commented Nov 27, 2018

jywarren commented Nov 27, 2018 via email

grvsachdeva commented Nov 28, 2018

okonek commented Nov 30, 2018

jywarren commented Nov 30, 2018

jywarren commented Nov 30, 2018 via email

okonek commented Dec 1, 2018

okonek commented Dec 1, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jywarren commented Dec 1, 2018

okonek commented Dec 1, 2018

jywarren commented Dec 1, 2018

oorjitchowdhary commented Dec 1, 2018

okonek commented Dec 1, 2018

okonek commented Dec 1, 2018

jywarren commented Dec 1, 2018 via email

okonek commented Dec 1, 2018

okonek commented Dec 1, 2018

okonek commented Dec 1, 2018

jywarren commented Dec 1, 2018 via email

jywarren commented Dec 2, 2018

okonek commented Dec 2, 2018

SidharthBansal commented Dec 6, 2018

SidharthBansal commented Dec 6, 2018

okonek commented Nov 25, 2018 •

edited

Loading

plotsbot commented Nov 25, 2018 •

edited

Loading