outputting-csv.txt 6.8 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173
  1. ==========================
  2. Outputting CSV with Django
  3. ==========================
  4. This document explains how to output CSV (Comma Separated Values) dynamically
  5. using Django views. To do this, you can either use the Python CSV library or the
  6. Django template system.
  7. Using the Python CSV library
  8. ============================
  9. Python comes with a CSV library, :mod:`csv`. The key to using it with Django is
  10. that the :mod:`csv` module's CSV-creation capability acts on file-like objects,
  11. and Django's :class:`~django.http.HttpResponse` objects are file-like objects.
  12. Here's an example::
  13. import csv
  14. from django.http import HttpResponse
  15. def some_view(request):
  16. # Create the HttpResponse object with the appropriate CSV header.
  17. response = HttpResponse(content_type='text/csv')
  18. response['Content-Disposition'] = 'attachment; filename="somefilename.csv"'
  19. writer = csv.writer(response)
  20. writer.writerow(['First row', 'Foo', 'Bar', 'Baz'])
  21. writer.writerow(['Second row', 'A', 'B', 'C', '"Testing"', "Here's a quote"])
  22. return response
  23. The code and comments should be self-explanatory, but a few things deserve a
  24. mention:
  25. * The response gets a special MIME type, :mimetype:`text/csv`. This tells
  26. browsers that the document is a CSV file, rather than an HTML file. If
  27. you leave this off, browsers will probably interpret the output as HTML,
  28. which will result in ugly, scary gobbledygook in the browser window.
  29. * The response gets an additional ``Content-Disposition`` header, which
  30. contains the name of the CSV file. This filename is arbitrary; call it
  31. whatever you want. It'll be used by browsers in the "Save as..."
  32. dialogue, etc.
  33. * Hooking into the CSV-generation API is easy: Just pass ``response`` as the
  34. first argument to ``csv.writer``. The ``csv.writer`` function expects a
  35. file-like object, and :class:`~django.http.HttpResponse` objects fit the
  36. bill.
  37. * For each row in your CSV file, call ``writer.writerow``, passing it an
  38. iterable object such as a list or tuple.
  39. * The CSV module takes care of quoting for you, so you don't have to worry
  40. about escaping strings with quotes or commas in them. Just pass
  41. ``writerow()`` your raw strings, and it'll do the right thing.
  42. Handling Unicode
  43. ~~~~~~~~~~~~~~~~
  44. Python's :mod:`csv` module does not support Unicode input. Since Django uses
  45. Unicode internally this means strings read from sources such as
  46. :class:`~django.http.HttpRequest` are potentially problematic. There are a few
  47. options for handling this:
  48. * Manually encode all Unicode objects to a compatible encoding.
  49. * Use the ``UnicodeWriter`` class provided in the `csv module's examples
  50. section`_.
  51. * Use the `python-unicodecsv module`_, which aims to be a drop-in
  52. replacement for :mod:`csv` that gracefully handles Unicode.
  53. For more information, see the Python documentation of the :mod:`csv` module.
  54. .. _`csv module's examples section`: http://docs.python.org/library/csv.html#examples
  55. .. _`python-unicodecsv module`: https://github.com/jdunck/python-unicodecsv
  56. .. _streaming-csv-files:
  57. Streaming large CSV files
  58. ~~~~~~~~~~~~~~~~~~~~~~~~~
  59. When dealing with views that generate very large responses, you might want to
  60. consider using Django's :class:`~django.http.StreamingHttpResponse` instead.
  61. For example, by streaming a file that takes a long time to generate you can
  62. avoid a load balancer dropping a connection that might have otherwise timed out
  63. while the server was generating the response.
  64. In this example, we make full use of Python generators to efficiently handle
  65. the assembly and transmission of a large CSV file::
  66. import csv
  67. from django.utils.six.moves import range
  68. from django.http import StreamingHttpResponse
  69. class Echo(object):
  70. """An object that implements just the write method of the file-like
  71. interface.
  72. """
  73. def write(self, value):
  74. """Write the value by returning it, instead of storing in a buffer."""
  75. return value
  76. def some_streaming_csv_view(request):
  77. """A view that streams a large CSV file."""
  78. # Generate a sequence of rows. The range is based on the maximum number of
  79. # rows that can be handled by a single sheet in most spreadsheet
  80. # applications.
  81. rows = (["Row {0}".format(idx), str(idx)] for idx in range(65536))
  82. pseudo_buffer = Echo()
  83. writer = csv.writer(pseudo_buffer)
  84. response = StreamingHttpResponse((writer.writerow(row) for row in rows),
  85. content_type="text/csv")
  86. response['Content-Disposition'] = 'attachment; filename="somefilename.csv"'
  87. return response
  88. Using the template system
  89. =========================
  90. Alternatively, you can use the :doc:`Django template system </topics/templates>`
  91. to generate CSV. This is lower-level than using the convenient Python :mod:`csv`
  92. module, but the solution is presented here for completeness.
  93. The idea here is to pass a list of items to your template, and have the
  94. template output the commas in a :ttag:`for` loop.
  95. Here's an example, which generates the same CSV file as above::
  96. from django.http import HttpResponse
  97. from django.template import loader, Context
  98. def some_view(request):
  99. # Create the HttpResponse object with the appropriate CSV header.
  100. response = HttpResponse(content_type='text/csv')
  101. response['Content-Disposition'] = 'attachment; filename="somefilename.csv"'
  102. # The data is hard-coded here, but you could load it from a database or
  103. # some other source.
  104. csv_data = (
  105. ('First row', 'Foo', 'Bar', 'Baz'),
  106. ('Second row', 'A', 'B', 'C', '"Testing"', "Here's a quote"),
  107. )
  108. t = loader.get_template('my_template_name.txt')
  109. c = Context({
  110. 'data': csv_data,
  111. })
  112. response.write(t.render(c))
  113. return response
  114. The only difference between this example and the previous example is that this
  115. one uses template loading instead of the CSV module. The rest of the code --
  116. such as the ``content_type='text/csv'`` -- is the same.
  117. Then, create the template ``my_template_name.txt``, with this template code:
  118. .. code-block:: html+django
  119. {% for row in data %}"{{ row.0|addslashes }}", "{{ row.1|addslashes }}", "{{ row.2|addslashes }}", "{{ row.3|addslashes }}", "{{ row.4|addslashes }}"
  120. {% endfor %}
  121. This template is quite basic. It just iterates over the given data and displays
  122. a line of CSV for each row. It uses the :tfilter:`addslashes` template filter to
  123. ensure there aren't any problems with quotes.
  124. Other text-based formats
  125. ========================
  126. Notice that there isn't very much specific to CSV here -- just the specific
  127. output format. You can use either of these techniques to output any text-based
  128. format you can dream of. You can also use a similar technique to generate
  129. arbitrary binary data; see :doc:`/howto/outputting-pdf` for an example.