performance.txt 17 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436
  1. ============================
  2. Performance and optimization
  3. ============================
  4. This document provides an overview of techniques and tools that can help get
  5. your Django code running more efficiently - faster, and using fewer system
  6. resources.
  7. Introduction
  8. ============
  9. Generally one's first concern is to write code that *works*, whose logic
  10. functions as required to produce the expected output. Sometimes, however, this
  11. will not be enough to make the code work as *efficiently* as one would like.
  12. In this case, what's needed is something - and in practice, often a collection
  13. of things - to improve the code's performance without, or only minimally,
  14. affecting its behavior.
  15. General approaches
  16. ==================
  17. What are you optimizing *for*?
  18. ------------------------------
  19. It's important to have a clear idea what you mean by 'performance'. There is
  20. not just one metric of it.
  21. Improved speed might be the most obvious aim for a program, but sometimes other
  22. performance improvements might be sought, such as lower memory consumption or
  23. fewer demands on the database or network.
  24. Improvements in one area will often bring about improved performance in
  25. another, but not always; sometimes one can even be at the expense of another.
  26. For example, an improvement in a program's speed might cause it to use more
  27. memory. Even worse, it can be self-defeating - if the speed improvement is so
  28. memory-hungry that the system starts to run out of memory, you'll have done
  29. more harm than good.
  30. There are other trade-offs to bear in mind. Your own time is a valuable
  31. resource, more precious than CPU time. Some improvements might be too difficult
  32. to be worth implementing, or might affect the portability or maintainability of
  33. the code. Not all performance improvements are worth the effort.
  34. So, you need to know what performance improvements you are aiming for, and you
  35. also need to know that you have a good reason for aiming in that direction -
  36. and for that you need:
  37. Performance benchmarking
  38. ------------------------
  39. It's no good just guessing or assuming where the inefficiencies lie in your
  40. code.
  41. Django tools
  42. ~~~~~~~~~~~~
  43. `django-debug-toolbar
  44. <https://github.com/jazzband/django-debug-toolbar/>`_ is a very handy tool that
  45. provides insights into what your code is doing and how much time it spends
  46. doing it. In particular it can show you all the SQL queries your page is
  47. generating, and how long each one has taken.
  48. Third-party panels are also available for the toolbar, that can (for example)
  49. report on cache performance and template rendering times.
  50. Third-party services
  51. ~~~~~~~~~~~~~~~~~~~~
  52. There are a number of free services that will analyze and report on the
  53. performance of your site's pages from the perspective of a remote HTTP client,
  54. in effect simulating the experience of an actual user.
  55. These can't report on the internals of your code, but can provide a useful
  56. insight into your site's overall performance, including aspects that can't be
  57. adequately measured from within Django environment. Examples include:
  58. * `Yahoo's Yslow <http://yslow.org/>`_
  59. * `Google PageSpeed <https://developers.google.com/speed/>`_
  60. There are also several paid-for services that perform a similar analysis,
  61. including some that are Django-aware and can integrate with your codebase to
  62. profile its performance far more comprehensively.
  63. Get things right from the start
  64. -------------------------------
  65. Some work in optimization involves tackling performance shortcomings, but some
  66. of the work can be built-in to what you'd do anyway, as part of the good
  67. practices you should adopt even before you start thinking about improving
  68. performance.
  69. In this respect Python is an excellent language to work with, because solutions
  70. that look elegant and feel right usually are the best performing ones. As with
  71. most skills, learning what "looks right" takes practice, but one of the most
  72. useful guidelines is:
  73. Work at the appropriate level
  74. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  75. Django offers many different ways of approaching things, but just because it's
  76. possible to do something in a certain way doesn't mean that it's the most
  77. appropriate way to do it. For example, you might find that you could calculate
  78. the same thing - the number of items in a collection, perhaps - in a
  79. ``QuerySet``, in Python, or in a template.
  80. However, it will almost always be faster to do this work at lower rather than
  81. higher levels. At higher levels the system has to deal with objects through
  82. multiple levels of abstraction and layers of machinery.
  83. That is, the database can typically do things faster than Python can, which can
  84. do them faster than the template language can::
  85. # QuerySet operation on the database
  86. # fast, because that's what databases are good at
  87. my_bicycles.count()
  88. # counting Python objects
  89. # slower, because it requires a database query anyway, and processing
  90. # of the Python objects
  91. len(my_bicycles)
  92. .. code-block:: html+django
  93. <!--
  94. Django template filter
  95. slower still, because it will have to count them in Python anyway,
  96. and because of template language overheads
  97. -->
  98. {{ my_bicycles|length }}
  99. Generally speaking, the most appropriate level for the job is the lowest-level
  100. one that it is comfortable to code for.
  101. .. note::
  102. The example above is merely illustrative.
  103. Firstly, in a real-life case you need to consider what is happening before
  104. and after your count to work out what's an optimal way of doing it *in that
  105. particular context*. The database optimization documents describes :ref:`a
  106. case where counting in the template would be better
  107. <overuse_of_count_and_exists>`.
  108. Secondly, there are other options to consider: in a real-life case, ``{{
  109. my_bicycles.count }}``, which invokes the ``QuerySet`` ``count()`` method
  110. directly from the template, might be the most appropriate choice.
  111. Caching
  112. =======
  113. Often it is expensive (that is, resource-hungry and slow) to compute a value,
  114. so there can be huge benefit in saving the value to a quickly accessible cache,
  115. ready for the next time it's required.
  116. It's a sufficiently significant and powerful technique that Django includes a
  117. comprehensive caching framework, as well as other smaller pieces of caching
  118. functionality.
  119. :doc:`The caching framework </topics/cache>`
  120. --------------------------------------------
  121. Django's :doc:`caching framework </topics/cache>` offers very significant
  122. opportunities for performance gains, by saving dynamic content so that it
  123. doesn't need to be calculated for each request.
  124. For convenience, Django offers different levels of cache granularity: you can
  125. cache the output of specific views, or only the pieces that are difficult to
  126. produce, or even an entire site.
  127. Implementing caching should not be regarded as an alternative to improving code
  128. that's performing poorly because it has been written badly. It's one of the
  129. final steps toward producing well-performing code, not a shortcut.
  130. :class:`~django.utils.functional.cached_property`
  131. -------------------------------------------------
  132. It's common to have to call a class instance's method more than once. If
  133. that function is expensive, then doing so can be wasteful.
  134. Using the :class:`~django.utils.functional.cached_property` decorator saves the
  135. value returned by a property; the next time the function is called on that
  136. instance, it will return the saved value rather than re-computing it. Note that
  137. this only works on methods that take ``self`` as their only argument and that
  138. it changes the method to a property.
  139. Certain Django components also have their own caching functionality; these are
  140. discussed below in the sections related to those components.
  141. Understanding laziness
  142. ======================
  143. *Laziness* is a strategy complementary to caching. Caching avoids
  144. recomputation by saving results; laziness delays computation until it's
  145. actually required.
  146. Laziness allows us to refer to things before they are instantiated, or even
  147. before it's possible to instantiate them. This has numerous uses.
  148. For example, :ref:`lazy translation <lazy-translations>` can be used before the
  149. target language is even known, because it doesn't take place until the
  150. translated string is actually required, such as in a rendered template.
  151. Laziness is also a way to save effort by trying to avoid work in the first
  152. place. That is, one aspect of laziness is not doing anything until it has to be
  153. done, because it may not turn out to be necessary after all. Laziness can
  154. therefore have performance implications, and the more expensive the work
  155. concerned, the more there is to gain through laziness.
  156. Python provides a number of tools for lazy evaluation, particularly through the
  157. :py:term:`generator` and :py:term:`generator expression` constructs. It's worth
  158. reading up on laziness in Python to discover opportunities for making use of
  159. lazy patterns in your code.
  160. Laziness in Django
  161. ------------------
  162. Django is itself quite lazy. A good example of this can be found in the
  163. evaluation of ``QuerySets``. :ref:`QuerySets are lazy <querysets-are-lazy>`.
  164. Thus a ``QuerySet`` can be created, passed around and combined with other
  165. ``QuerySets``, without actually incurring any trips to the database to fetch
  166. the items it describes. What gets passed around is the ``QuerySet`` object, not
  167. the collection of items that - eventually - will be required from the database.
  168. On the other hand, :ref:`certain operations will force the evaluation of a
  169. QuerySet <when-querysets-are-evaluated>`. Avoiding the premature evaluation of
  170. a ``QuerySet`` can save making an expensive and unnecessary trip to the
  171. database.
  172. Django also offers a :meth:`~django.utils.functional.keep_lazy` decorator.
  173. This allows a function that has been called with a lazy argument to behave
  174. lazily itself, only being evaluated when it needs to be. Thus the lazy argument
  175. - which could be an expensive one - will not be called upon for evaluation
  176. until it's strictly required.
  177. Databases
  178. =========
  179. Database optimization
  180. ---------------------
  181. Django's database layer provides various ways to help developers get the best
  182. performance from their databases. The :doc:`database optimization documentation
  183. </topics/db/optimization>` gathers together links to the relevant
  184. documentation and adds various tips that outline the steps to take when
  185. attempting to optimize your database usage.
  186. Other database-related tips
  187. ---------------------------
  188. Enabling :ref:`persistent-database-connections` can speed up connections to the
  189. database accounts for a significant part of the request processing time.
  190. This helps a lot on virtualized hosts with limited network performance, for example.
  191. HTTP performance
  192. ================
  193. Middleware
  194. ----------
  195. Django comes with a few helpful pieces of :doc:`middleware </ref/middleware>`
  196. that can help optimize your site's performance. They include:
  197. :class:`~django.middleware.http.ConditionalGetMiddleware`
  198. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  199. Adds support for modern browsers to conditionally GET responses based on the
  200. ``ETag`` and ``Last-Modified`` headers. It also calculates and sets an ETag if
  201. needed.
  202. :class:`~django.middleware.gzip.GZipMiddleware`
  203. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  204. Compresses responses for all modern browsers, saving bandwidth and transfer
  205. time. Note that GZipMiddleware is currently considered a security risk, and is
  206. vulnerable to attacks that nullify the protection provided by TLS/SSL. See the
  207. warning in :class:`~django.middleware.gzip.GZipMiddleware` for more information.
  208. Sessions
  209. --------
  210. Using cached sessions
  211. ~~~~~~~~~~~~~~~~~~~~~
  212. :ref:`Using cached sessions <cached-sessions-backend>` may be a way to increase
  213. performance by eliminating the need to load session data from a slower storage
  214. source like the database and instead storing frequently used session data in
  215. memory.
  216. Static files
  217. ------------
  218. Static files, which by definition are not dynamic, make an excellent target for
  219. optimization gains.
  220. :class:`~django.contrib.staticfiles.storage.ManifestStaticFilesStorage`
  221. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  222. By taking advantage of web browsers' caching abilities, you can
  223. eliminate network hits entirely for a given file after the initial download.
  224. :class:`~django.contrib.staticfiles.storage.ManifestStaticFilesStorage` appends a
  225. content-dependent tag to the filenames of :doc:`static files
  226. </ref/contrib/staticfiles>` to make it safe for browsers to cache them
  227. long-term without missing future changes - when a file changes, so will the
  228. tag, so browsers will reload the asset automatically.
  229. "Minification"
  230. ~~~~~~~~~~~~~~
  231. Several third-party Django tools and packages provide the ability to "minify"
  232. HTML, CSS, and JavaScript. They remove unnecessary whitespace, newlines, and
  233. comments, and shorten variable names, and thus reduce the size of the documents
  234. that your site publishes.
  235. Template performance
  236. ====================
  237. Note that:
  238. * using ``{% block %}`` is faster than using ``{% include %}``
  239. * heavily-fragmented templates, assembled from many small pieces, can affect
  240. performance
  241. The cached template loader
  242. --------------------------
  243. Enabling the :class:`cached template loader
  244. <django.template.loaders.cached.Loader>` often improves performance
  245. drastically, as it avoids compiling each template every time it needs to be
  246. rendered.
  247. Using different versions of available software
  248. ==============================================
  249. It can sometimes be worth checking whether different and better-performing
  250. versions of the software that you're using are available.
  251. These techniques are targeted at more advanced users who want to push the
  252. boundaries of performance of an already well-optimized Django site.
  253. However, they are not magic solutions to performance problems, and they're
  254. unlikely to bring better than marginal gains to sites that don't already do the
  255. more basic things the right way.
  256. .. note::
  257. It's worth repeating: **reaching for alternatives to software you're
  258. already using is never the first answer to performance problems**. When
  259. you reach this level of optimization, you need a formal benchmarking
  260. solution.
  261. Newer is often - but not always - better
  262. ----------------------------------------
  263. It's fairly rare for a new release of well-maintained software to be less
  264. efficient, but the maintainers can't anticipate every possible use-case - so
  265. while being aware that newer versions are likely to perform better, don't
  266. assume that they always will.
  267. This is true of Django itself. Successive releases have offered a number of
  268. improvements across the system, but you should still check the real-world
  269. performance of your application, because in some cases you may find that
  270. changes mean it performs worse rather than better.
  271. Newer versions of Python, and also of Python packages, will often perform
  272. better too - but measure, rather than assume.
  273. .. note::
  274. Unless you've encountered an unusual performance problem in a particular
  275. version, you'll generally find better features, reliability, and security
  276. in a new release and that these benefits are far more significant than any
  277. performance you might win or lose.
  278. Alternatives to Django's template language
  279. ------------------------------------------
  280. For nearly all cases, Django's built-in template language is perfectly
  281. adequate. However, if the bottlenecks in your Django project seem to lie in the
  282. template system and you have exhausted other opportunities to remedy this, a
  283. third-party alternative may be the answer.
  284. Jinja2_ can offer performance improvements, particularly when it comes to
  285. speed.
  286. Alternative template systems vary in the extent to which they share Django's
  287. templating language.
  288. .. note::
  289. *If* you experience performance issues in templates, the first thing to do
  290. is to understand exactly why. Using an alternative template system may
  291. prove faster, but the same gains may also be available without going to
  292. that trouble - for example, expensive processing and logic in your
  293. templates could be done more efficiently in your views.
  294. Alternative software implementations
  295. ------------------------------------
  296. It may be worth checking whether Python software you're using has been
  297. provided in a different implementation that can execute the same code faster.
  298. However: most performance problems in well-written Django sites aren't at the
  299. Python execution level, but rather in inefficient database querying, caching,
  300. and templates. If you're relying on poorly-written Python code, your
  301. performance problems are unlikely to be solved by having it execute faster.
  302. Using an alternative implementation may introduce compatibility, deployment,
  303. portability, or maintenance issues. It goes without saying that before adopting
  304. a non-standard implementation you should ensure it provides sufficient
  305. performance gains for your application to outweigh the potential risks.
  306. With these caveats in mind, you should be aware of:
  307. `PyPy <https://www.pypy.org/>`_
  308. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  309. `PyPy <https://www.pypy.org/>`_ is an implementation of Python in Python itself
  310. (the 'standard' Python implementation is in C). PyPy can offer substantial
  311. performance gains, typically for heavyweight applications.
  312. A key aim of the PyPy project is `compatibility
  313. <https://www.pypy.org/compat.html>`_ with existing Python APIs and libraries.
  314. Django is compatible, but you will need to check the compatibility of other
  315. libraries you rely on.
  316. C implementations of Python libraries
  317. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  318. Some Python libraries are also implemented in C, and can be much faster. They
  319. aim to offer the same APIs. Note that compatibility issues and behavior
  320. differences are not unknown (and not always immediately evident).
  321. .. _Jinja2: https://jinja.palletsprojects.com/