aggregates.txt 12 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381
  1. =========================================
  2. PostgreSQL specific aggregation functions
  3. =========================================
  4. .. module:: django.contrib.postgres.aggregates
  5. :synopsis: PostgreSQL specific aggregation functions
  6. These functions are available from the ``django.contrib.postgres.aggregates``
  7. module. They are described in more detail in the `PostgreSQL docs
  8. <https://www.postgresql.org/docs/current/functions-aggregate.html>`_.
  9. .. note::
  10. All functions come without default aliases, so you must explicitly provide
  11. one. For example::
  12. >>> SomeModel.objects.aggregate(arr=ArrayAgg('somefield'))
  13. {'arr': [0, 1, 2]}
  14. .. admonition:: Common aggregate options
  15. All aggregates have the :ref:`filter <aggregate-filter>` keyword argument
  16. and most also have the :ref:`default <aggregate-default>` keyword argument.
  17. General-purpose aggregation functions
  18. =====================================
  19. ``ArrayAgg``
  20. ------------
  21. .. class:: ArrayAgg(expression, distinct=False, filter=None, default=None, ordering=(), **extra)
  22. Returns a list of values, including nulls, concatenated into an array, or
  23. ``default`` if there are no values.
  24. .. attribute:: distinct
  25. An optional boolean argument that determines if array values
  26. will be distinct. Defaults to ``False``.
  27. .. attribute:: ordering
  28. An optional string of a field name (with an optional ``"-"`` prefix
  29. which indicates descending order) or an expression (or a tuple or list
  30. of strings and/or expressions) that specifies the ordering of the
  31. elements in the result list.
  32. Examples::
  33. 'some_field'
  34. '-some_field'
  35. from django.db.models import F
  36. F('some_field').desc()
  37. .. deprecated:: 4.0
  38. If there are no rows and ``default`` is not provided, ``ArrayAgg``
  39. returns an empty list instead of ``None``. This behavior is deprecated
  40. and will be removed in Django 5.0. If you need it, explicitly set
  41. ``default`` to ``Value([])``.
  42. ``BitAnd``
  43. ----------
  44. .. class:: BitAnd(expression, filter=None, default=None, **extra)
  45. Returns an ``int`` of the bitwise ``AND`` of all non-null input values, or
  46. ``default`` if all values are null.
  47. ``BitOr``
  48. ---------
  49. .. class:: BitOr(expression, filter=None, default=None, **extra)
  50. Returns an ``int`` of the bitwise ``OR`` of all non-null input values, or
  51. ``default`` if all values are null.
  52. ``BitXor``
  53. ----------
  54. .. class:: BitXor(expression, filter=None, default=None, **extra)
  55. Returns an ``int`` of the bitwise ``XOR`` of all non-null input values, or
  56. ``default`` if all values are null. It requires PostgreSQL 14+.
  57. ``BoolAnd``
  58. -----------
  59. .. class:: BoolAnd(expression, filter=None, default=None, **extra)
  60. Returns ``True``, if all input values are true, ``default`` if all values
  61. are null or if there are no values, otherwise ``False``.
  62. Usage example::
  63. class Comment(models.Model):
  64. body = models.TextField()
  65. published = models.BooleanField()
  66. rank = models.IntegerField()
  67. >>> from django.db.models import Q
  68. >>> from django.contrib.postgres.aggregates import BoolAnd
  69. >>> Comment.objects.aggregate(booland=BoolAnd('published'))
  70. {'booland': False}
  71. >>> Comment.objects.aggregate(booland=BoolAnd(Q(rank__lt=100)))
  72. {'booland': True}
  73. ``BoolOr``
  74. ----------
  75. .. class:: BoolOr(expression, filter=None, default=None, **extra)
  76. Returns ``True`` if at least one input value is true, ``default`` if all
  77. values are null or if there are no values, otherwise ``False``.
  78. Usage example::
  79. class Comment(models.Model):
  80. body = models.TextField()
  81. published = models.BooleanField()
  82. rank = models.IntegerField()
  83. >>> from django.db.models import Q
  84. >>> from django.contrib.postgres.aggregates import BoolOr
  85. >>> Comment.objects.aggregate(boolor=BoolOr('published'))
  86. {'boolor': True}
  87. >>> Comment.objects.aggregate(boolor=BoolOr(Q(rank__gt=2)))
  88. {'boolor': False}
  89. ``JSONBAgg``
  90. ------------
  91. .. class:: JSONBAgg(expressions, distinct=False, filter=None, default=None, ordering=(), **extra)
  92. Returns the input values as a ``JSON`` array, or ``default`` if there are
  93. no values. You can query the result using :lookup:`key and index lookups
  94. <jsonfield.key>`.
  95. .. attribute:: distinct
  96. An optional boolean argument that determines if array values will be
  97. distinct. Defaults to ``False``.
  98. .. attribute:: ordering
  99. An optional string of a field name (with an optional ``"-"`` prefix
  100. which indicates descending order) or an expression (or a tuple or list
  101. of strings and/or expressions) that specifies the ordering of the
  102. elements in the result list.
  103. Examples are the same as for :attr:`ArrayAgg.ordering`.
  104. Usage example::
  105. class Room(models.Model):
  106. number = models.IntegerField(unique=True)
  107. class HotelReservation(models.Model):
  108. room = models.ForeignKey('Room', on_delete=models.CASCADE)
  109. start = models.DateTimeField()
  110. end = models.DateTimeField()
  111. requirements = models.JSONField(blank=True, null=True)
  112. >>> from django.contrib.postgres.aggregates import JSONBAgg
  113. >>> Room.objects.annotate(
  114. ... requirements=JSONBAgg(
  115. ... 'hotelreservation__requirements',
  116. ... ordering='-hotelreservation__start',
  117. ... )
  118. ... ).filter(requirements__0__sea_view=True).values('number', 'requirements')
  119. <QuerySet [{'number': 102, 'requirements': [
  120. {'parking': False, 'sea_view': True, 'double_bed': False},
  121. {'parking': True, 'double_bed': True}
  122. ]}]>
  123. .. deprecated:: 4.0
  124. If there are no rows and ``default`` is not provided, ``JSONBAgg``
  125. returns an empty list instead of ``None``. This behavior is deprecated
  126. and will be removed in Django 5.0. If you need it, explicitly set
  127. ``default`` to ``Value('[]')``.
  128. ``StringAgg``
  129. -------------
  130. .. class:: StringAgg(expression, delimiter, distinct=False, filter=None, default=None, ordering=())
  131. Returns the input values concatenated into a string, separated by
  132. the ``delimiter`` string, or ``default`` if there are no values.
  133. .. attribute:: delimiter
  134. Required argument. Needs to be a string.
  135. .. attribute:: distinct
  136. An optional boolean argument that determines if concatenated values
  137. will be distinct. Defaults to ``False``.
  138. .. attribute:: ordering
  139. An optional string of a field name (with an optional ``"-"`` prefix
  140. which indicates descending order) or an expression (or a tuple or list
  141. of strings and/or expressions) that specifies the ordering of the
  142. elements in the result string.
  143. Examples are the same as for :attr:`ArrayAgg.ordering`.
  144. Usage example::
  145. class Publication(models.Model):
  146. title = models.CharField(max_length=30)
  147. class Article(models.Model):
  148. headline = models.CharField(max_length=100)
  149. publications = models.ManyToManyField(Publication)
  150. >>> article = Article.objects.create(headline="NASA uses Python")
  151. >>> article.publications.create(title="The Python Journal")
  152. <Publication: Publication object (1)>
  153. >>> article.publications.create(title="Science News")
  154. <Publication: Publication object (2)>
  155. >>> from django.contrib.postgres.aggregates import StringAgg
  156. >>> Article.objects.annotate(
  157. ... publication_names=StringAgg(
  158. ... "publications__title",
  159. ... delimiter=", ",
  160. ... ordering="publications__title",
  161. ... )
  162. ... ).values("headline", "publication_names")
  163. <QuerySet [{
  164. 'headline': 'NASA uses Python', 'publication_names': 'Science News, The Python Journal'
  165. }]>
  166. .. deprecated:: 4.0
  167. If there are no rows and ``default`` is not provided, ``StringAgg``
  168. returns an empty string instead of ``None``. This behavior is
  169. deprecated and will be removed in Django 5.0. If you need it,
  170. explicitly set ``default`` to ``Value('')``.
  171. Aggregate functions for statistics
  172. ==================================
  173. ``y`` and ``x``
  174. ---------------
  175. The arguments ``y`` and ``x`` for all these functions can be the name of a
  176. field or an expression returning a numeric data. Both are required.
  177. ``Corr``
  178. --------
  179. .. class:: Corr(y, x, filter=None, default=None)
  180. Returns the correlation coefficient as a ``float``, or ``default`` if there
  181. aren't any matching rows.
  182. ``CovarPop``
  183. ------------
  184. .. class:: CovarPop(y, x, sample=False, filter=None, default=None)
  185. Returns the population covariance as a ``float``, or ``default`` if there
  186. aren't any matching rows.
  187. .. attribute:: sample
  188. Optional. By default ``CovarPop`` returns the general population
  189. covariance. However, if ``sample=True``, the return value will be the
  190. sample population covariance.
  191. ``RegrAvgX``
  192. ------------
  193. .. class:: RegrAvgX(y, x, filter=None, default=None)
  194. Returns the average of the independent variable (``sum(x)/N``) as a
  195. ``float``, or ``default`` if there aren't any matching rows.
  196. ``RegrAvgY``
  197. ------------
  198. .. class:: RegrAvgY(y, x, filter=None, default=None)
  199. Returns the average of the dependent variable (``sum(y)/N``) as a
  200. ``float``, or ``default`` if there aren't any matching rows.
  201. ``RegrCount``
  202. -------------
  203. .. class:: RegrCount(y, x, filter=None)
  204. Returns an ``int`` of the number of input rows in which both expressions
  205. are not null.
  206. .. note::
  207. The ``default`` argument is not supported.
  208. ``RegrIntercept``
  209. -----------------
  210. .. class:: RegrIntercept(y, x, filter=None, default=None)
  211. Returns the y-intercept of the least-squares-fit linear equation determined
  212. by the ``(x, y)`` pairs as a ``float``, or ``default`` if there aren't any
  213. matching rows.
  214. ``RegrR2``
  215. ----------
  216. .. class:: RegrR2(y, x, filter=None, default=None)
  217. Returns the square of the correlation coefficient as a ``float``, or
  218. ``default`` if there aren't any matching rows.
  219. ``RegrSlope``
  220. -------------
  221. .. class:: RegrSlope(y, x, filter=None, default=None)
  222. Returns the slope of the least-squares-fit linear equation determined
  223. by the ``(x, y)`` pairs as a ``float``, or ``default`` if there aren't any
  224. matching rows.
  225. ``RegrSXX``
  226. -----------
  227. .. class:: RegrSXX(y, x, filter=None, default=None)
  228. Returns ``sum(x^2) - sum(x)^2/N`` ("sum of squares" of the independent
  229. variable) as a ``float``, or ``default`` if there aren't any matching rows.
  230. ``RegrSXY``
  231. -----------
  232. .. class:: RegrSXY(y, x, filter=None, default=None)
  233. Returns ``sum(x*y) - sum(x) * sum(y)/N`` ("sum of products" of independent
  234. times dependent variable) as a ``float``, or ``default`` if there aren't
  235. any matching rows.
  236. ``RegrSYY``
  237. -----------
  238. .. class:: RegrSYY(y, x, filter=None, default=None)
  239. Returns ``sum(y^2) - sum(y)^2/N`` ("sum of squares" of the dependent
  240. variable) as a ``float``, or ``default`` if there aren't any matching rows.
  241. Usage examples
  242. ==============
  243. We will use this example table::
  244. | FIELD1 | FIELD2 | FIELD3 |
  245. |--------|--------|--------|
  246. | foo | 1 | 13 |
  247. | bar | 2 | (null) |
  248. | test | 3 | 13 |
  249. Here's some examples of some of the general-purpose aggregation functions::
  250. >>> TestModel.objects.aggregate(result=StringAgg('field1', delimiter=';'))
  251. {'result': 'foo;bar;test'}
  252. >>> TestModel.objects.aggregate(result=ArrayAgg('field2'))
  253. {'result': [1, 2, 3]}
  254. >>> TestModel.objects.aggregate(result=ArrayAgg('field1'))
  255. {'result': ['foo', 'bar', 'test']}
  256. The next example shows the usage of statistical aggregate functions. The
  257. underlying math will be not described (you can read about this, for example, at
  258. `wikipedia <https://en.wikipedia.org/wiki/Regression_analysis>`_)::
  259. >>> TestModel.objects.aggregate(count=RegrCount(y='field3', x='field2'))
  260. {'count': 2}
  261. >>> TestModel.objects.aggregate(avgx=RegrAvgX(y='field3', x='field2'),
  262. ... avgy=RegrAvgY(y='field3', x='field2'))
  263. {'avgx': 2, 'avgy': 13}