This proposal has been accepted to be implemented in the main repository. Check for updates in the comments

Improvement of the Decidim’s metrics consistency throughout the application

Main repo (accepted)

Deleted participant 20/01/2023 11:21

Is your feature request related to a problem? Please describe

When I want to find out how many proposals exist on an existing platform (in this example I'll take https://ecrivons.angers.fr/). In search, it diplays that there is about 2631 proposals. However, when I look down on the front-office, it displays a total of 2331. Finally, if I go to the back-office, this number is 2575 (these numbers may change a bit in the future, but the gap is here).

The inference of this is that Decidim has multiple ways to mesure what's going on a platform.

Decidim::Search : when searching an empty string in the main search box of a Decidim Instance, we use the Decidim::Search namespace which use a complex system of hooks triggering computations that may not be ended when the page is displayed.
Decidim::Stats : statistics displayed on front-office, they are computed in real time.
Decidim::Metrics: the main concern of this proposal, displayed in back-office, these are the most detailed ones. Currently, they are computed, like the Decidim::Search with a Rake task. This task run every 24h by default, which can be unclear in periods of high participation (a voting phase of a budget as instance, with dozens/hundreds of votes every day).

It creates many misunderstandings among administrators, which are concerned about what number is the right one.

I took time to look at the Decidim::Metrics, since it appeared to be the most complete Ruby namespace on the subject among the three. What I found is that even for metrics that seems quite similar (proposals and “proposals accepted”), the way they are counted can be very divergent.

Let me give one example : there are 2 indicators that count proposals : a “total” one and a second that counts only accepted proposals.

The way total proposals are currently counted is the following :

we harvest all participatory spaces (published or not)
we harvest all published components of these participatory spaces
we harvest all proposals that belong to one of these components
… and we remove the followings :
proposals that are withdrawn (therefore, remaining proposals respect the condition state != withdrawn)
proposals that are moderated
proposals that aren’t published (keep it mind, it’ll make sense just after)

Now, when counting accepted proposals :

the harvesting process is similar, that’s consistent…
… and we remove the followings :
proposals that are accepted (therefore, remaining proposals respect the condition state == accepted
… and that’s all : proposals that are moderated are also counted, and proposals that aren’t published too, which could lead to a state where there could be a greater number of accepted proposals than the number of proposals.

Describe the solution you'd like

A clear description in the documentation of how statistics are computed throughout the application and why.
Rethink the current implementation of how these numbers are computed in order to have a consistent counting for each object (ie. proposals are counted the same way comments are counted ie. objects that are published that belong to a published component and not moderated, period). If not, detail it either in the admin panel or in the documentation and explain this behavior.
In a more bold but time-consuming way, we could at the same time have options that allows admin to modify the behavior of metrics (like a tickbox that allow counting of moderated objects, another tickbox that allow counting of unpublished objects etc.).

Describe alternatives you've considered

We currently use an external software (Metabase) to make our visualizations, but organizations do not have all the resource of a full-time data analyst to understand these subjects.

Additional context

Does this issue could impact on users private data?
Since it’s a rewriting of current visualizations, it normaly does affect but the current way data are communicated on the platform.

Funded by

Documentation work can be done by Open Source Politics
No funding (yet) for the correction of measures

Filter results for: Awaiting funding

Comment

Avatar: Xabier

Avatar: Pierre Mesure

Avatar: Pau Parals

Avatar: Open Source Politics

Avatar: Pauline Bessoles

Avatar: Simonas Zilinskas

Liked by Xabier and 9 more

Liked by

Avatar: Xabier Xabier Decidim Member

Avatar: Pierre Mesure Pierre Mesure

Avatar: Pau Parals Pau Parals Decidim Member

Avatar: Open Source Politics Open Source Politics Official participant

Avatar: Pauline Bessoles Pauline Bessoles Decidim Member

Deleted participant Decidim Member

Avatar: Simonas Zilinskas Simonas Zilinskas Decidim Member

Avatar: Romy Grasgruber-Kerl Romy Grasgruber-Kerl Decidim Member

Avatar: Lucie Grau Lucie Grau

1 comment

Fingerprint

The piece of text below is a shortened, hashed representation of this content. It is useful to ensure the content has not been tampered with, as a single modification would result in a totally different value.

Value: 130cafefcbbffd3708bfd39ab7db07e856779cf9d011418d06a77c8c14ac61f8

Source:

{"body":{"en":"<p><strong>Is your feature request related to a problem? Please describe</strong></p><p>When I want to find out how many proposals exist on an existing platform (in this example I'll take <a href=\"https://ecrivons.angers.fr\" rel=\"noopener noreferrer\" target=\"_blank\">https://ecrivons.angers.fr</a>/). In search, it diplays that there is about <strong>2631</strong> proposals. However, when I look down on the front-office, it displays a total of <strong>2331</strong>. Finally, if I go to the back-office, this number is <strong>2575</strong> (these numbers may change a bit in the future, but the gap is here).</p><p>The inference of this is that Decidim has multiple ways to mesure what's going on a platform.</p><ul><li><strong>Decidim::Search</strong> : when searching an empty string in the main search box of a Decidim Instance, we use the Decidim::Search namespace which use a complex system of hooks triggering computations that may not be ended when the page is displayed.</li><li><strong>Decidim::Stats</strong> : statistics displayed on front-office, they are computed <strong>i</strong>n real time.</li><li><strong>Decidim::Metrics</strong>: the main concern of this proposal, displayed in back-office, these are the most detailed ones. Currently, they are computed, like the Decidim::Search with a Rake task. This task run every 24h by default, which can be unclear in periods of high participation (a voting phase of a budget as instance, with dozens/hundreds of votes every day).</li></ul><p>It creates many misunderstandings among administrators, which are concerned about what number is the right one.</p><p>I took time to look at the Decidim::Metrics, since it appeared to be the most complete Ruby namespace on the subject among the three. What I found is that even for metrics that seems quite similar (proposals and “proposals accepted”), the way they are counted can be very divergent.</p><p>Let me give one example : there are <strong>2 indicators that count proposals</strong> : a “total” one and a second that counts only accepted proposals.</p><p>The way <strong>total proposals</strong> are currently counted is the following :</p><ul><li>we harvest all participatory spaces (published or not)</li><li>we harvest all published components of these participatory spaces</li><li>we harvest all proposals that belong to one of these components</li><li>… and we remove the followings :</li><li class=\"ql-indent-1\">proposals that are withdrawn (therefore, remaining proposals respect the condition <code>state != withdrawn</code>)</li><li class=\"ql-indent-1\">proposals that are moderated</li><li>proposals <strong>that aren’t published</strong> (keep it mind, it’ll make sense just after)</li></ul><p>Now, when counting <strong>accepted proposals</strong> :</p><ul><li>the harvesting process is similar, that’s consistent…</li><li>… and we remove the followings :</li><li class=\"ql-indent-1\">proposals that are accepted (therefore, remaining proposals respect the condition <code>state == accepted</code></li><li>… <u>and that’s all </u>: proposals that are moderated are also counted, and proposals that aren’t published too, <strong>which could lead to a state where there could be a greater number of accepted proposals than the number of proposals</strong>.</li></ul><p><strong>Describe the solution you'd like</strong><br></p><ul><li>A clear description in the documentation of how statistics are computed throughout the application and why.</li><li>Rethink the current implementation of how these numbers are computed in order to have a consistent counting for each object (ie. proposals are counted the same way comments are counted ie. objects that are published that belong to a published component and not moderated, period). If not, detail it either in the admin panel or in the documentation and explain this behavior.</li><li>In a more bold but time-consuming way, we could at the same time have options that allows admin to modify the behavior of metrics (like a tickbox that allow counting of moderated objects, another tickbox that allow counting of unpublished objects etc.).</li></ul><p><strong>Describe alternatives you've considered</strong></p><p>We currently use an external software (<a href=\"/701ce365d81b4693aaa5895a57a85218\" rel=\"noopener noreferrer\" target=\"_blank\">Metabase</a>) to make our visualizations, but organizations do not have all the resource of a full-time data analyst to understand these subjects.</p><p><strong>Additional context</strong></p><p><strong>Does this issue could impact on users private data?</strong><br>Since it’s a rewriting of current visualizations, it normaly does affect but the current way data are communicated on the platform.</p><p><strong>Funded by</strong><br></p><ul><li>Documentation work can be done by Open Source Politics</li><li>No funding (yet) for the correction of measures</li></ul>"},"title":{"en":"Improvement of the Decidim’s metrics consistency throughout the application"}}

This fingerprint is calculated using a SHA256 hashing algorithm. In order to replicate it yourself, you can use an MD5 calculator online and copy-paste the source data.

Essential

Preferences

Analytics and statistics

Marketing

Improvement of the Decidim’s metrics consistency throughout the application

Liked by

Please log in

Cookie settings

Essential

Preferences

Analytics and statistics

Marketing

Improvement of the Decidim’s metrics consistency throughout the application

Share

QR Code