-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Beyond word clouds #95912
Comments
Pinging @elastic/kibana-app (Team:KibanaApp) |
Agree, esp. if there's a need to show relations and not just allude to term frequencies, and there's enough space for the links and proximity layout to work. Word cloud is indeed a bit like the pie chart, can even be circular :-) elastic/elastic-charts#1038 |
Even for the simpler case of plain lists a bar chart can be clearer, as noted in that article. However we can do better than plain bar charts when it comes to lists of significant terms found in query results. Everything is drawn to scale and the zoom bar can be used to reveal details of minor celebrities and the percentage of their mentions that occur in the search results.
Currently word clouds use one stat to size words and any comparisons are hard because long words use more space than short words. |
Thank you for contributing to this issue, however, we are closing this issue due to inactivity as part of a backlog grooming effort. If you believe this feature/bug should still be considered, please reopen with a comment. |
Word cloud visualizations are nice eye candy but for practical use have a number of issues. They have been called "the pie chart of text data" and I find it hard to disagree.
Making sense of popular/significant terms can be greatly improved if we also provide a degree of clustering in the visualization.
As a real-world and topical example, here are the significant words generated from today's news headlines and rendered as a typical word cloud (using this):
The user is left wondering if Joe Biden's dog has anything to do with the Suez Canal and if Deliveroo drivers have been involved in a biting incident. If we use the adjacency matrix aggregation we can cluster these same terms by their co-occurrence and use a Graph visualization to give a much more useful summary of today's news:
We can clearly see that it was
Biden's dog
in thebiting incident
and that it was theever given
megaship
stuck in theSuez canal
. In my prototype these relationship lines that connect terms can also be clicked and a highlighter can be used to show where the connected terms were used in the original text:This style of interaction helps users quickly remove the mystery by providing the missing context.
Even if we don't adopt a graph visualization, the clusters produced by the adjacency matrix aggregation can be of use in colouring words based on the clusters they sit in.
It's also worth mentioning again that text fields are currently not supported in word cloud visualizations and the significant_text aggregation was specifically designed for producing these sorts of word discoveries from
text
fields, with special support for eliminating junk words from noisy text.The text was updated successfully, but these errors were encountered: