2005-10-01 00:02:51 +02:00
|
|
|
===========================
|
|
|
|
Outputting PDFs with Django
|
|
|
|
===========================
|
|
|
|
|
|
|
|
This document explains how to output PDF files dynamically using Django views.
|
|
|
|
This is made possible by the excellent, open-source ReportLab_ Python PDF
|
|
|
|
library.
|
|
|
|
|
|
|
|
The advantage of generating PDF files dynamically is that you can create
|
2005-10-04 17:17:22 +02:00
|
|
|
customized PDFs for different purposes -- say, for different users or different
|
2005-10-01 00:02:51 +02:00
|
|
|
pieces of content.
|
|
|
|
|
2006-02-18 17:45:36 +01:00
|
|
|
For example, Django was used at kusports.com_ to generate customized,
|
2005-10-01 00:02:51 +02:00
|
|
|
printer-friendly NCAA tournament brackets, as PDF files, for people
|
|
|
|
participating in a March Madness contest.
|
|
|
|
|
2009-12-26 07:37:26 +01:00
|
|
|
.. _ReportLab: http://www.reportlab.org/oss/rl-toolkit/
|
2006-02-18 17:45:36 +01:00
|
|
|
.. _kusports.com: http://www.kusports.com/
|
2005-10-01 00:02:51 +02:00
|
|
|
|
|
|
|
Install ReportLab
|
|
|
|
=================
|
|
|
|
|
2009-12-26 07:37:26 +01:00
|
|
|
Download and install the ReportLab library from http://www.reportlab.org/oss/rl-toolkit/download/.
|
2005-10-01 00:02:51 +02:00
|
|
|
The `user guide`_ (not coincidentally, a PDF file) explains how to install it.
|
|
|
|
|
2005-10-01 00:05:44 +02:00
|
|
|
Test your installation by importing it in the Python interactive interpreter::
|
2005-10-01 00:02:51 +02:00
|
|
|
|
2005-10-01 00:05:44 +02:00
|
|
|
>>> import reportlab
|
2005-10-01 00:02:51 +02:00
|
|
|
|
|
|
|
If that command doesn't raise any errors, the installation worked.
|
|
|
|
|
2009-02-22 07:06:29 +01:00
|
|
|
.. _user guide: http://www.reportlab.com/docs/reportlab-userguide.pdf
|
2005-10-01 00:02:51 +02:00
|
|
|
|
|
|
|
Write your view
|
|
|
|
===============
|
|
|
|
|
|
|
|
The key to generating PDFs dynamically with Django is that the ReportLab API
|
2008-08-24 00:25:40 +02:00
|
|
|
acts on file-like objects, and Django's :class:`~django.http.HttpResponse`
|
|
|
|
objects are file-like objects.
|
2005-10-01 00:02:51 +02:00
|
|
|
|
|
|
|
Here's a "Hello World" example::
|
|
|
|
|
|
|
|
from reportlab.pdfgen import canvas
|
2006-05-02 03:31:56 +02:00
|
|
|
from django.http import HttpResponse
|
2005-10-01 00:02:51 +02:00
|
|
|
|
|
|
|
def some_view(request):
|
|
|
|
# Create the HttpResponse object with the appropriate PDF headers.
|
|
|
|
response = HttpResponse(mimetype='application/pdf')
|
|
|
|
response['Content-Disposition'] = 'attachment; filename=somefilename.pdf'
|
|
|
|
|
|
|
|
# Create the PDF object, using the response object as its "file."
|
|
|
|
p = canvas.Canvas(response)
|
|
|
|
|
|
|
|
# Draw things on the PDF. Here's where the PDF generation happens.
|
|
|
|
# See the ReportLab documentation for the full list of functionality.
|
|
|
|
p.drawString(100, 100, "Hello world.")
|
|
|
|
|
|
|
|
# Close the PDF object cleanly, and we're done.
|
|
|
|
p.showPage()
|
|
|
|
p.save()
|
|
|
|
return response
|
|
|
|
|
|
|
|
The code and comments should be self-explanatory, but a few things deserve a
|
|
|
|
mention:
|
|
|
|
|
2008-08-24 00:25:40 +02:00
|
|
|
* The response gets a special MIME type, ``application/pdf``. This tells
|
2005-10-01 00:02:51 +02:00
|
|
|
browsers that the document is a PDF file, rather than an HTML file. If
|
|
|
|
you leave this off, browsers will probably interpret the output as HTML,
|
|
|
|
which would result in ugly, scary gobbledygook in the browser window.
|
|
|
|
|
|
|
|
* The response gets an additional ``Content-Disposition`` header, which
|
|
|
|
contains the name of the PDF file. This filename is arbitrary: Call it
|
|
|
|
whatever you want. It'll be used by browsers in the "Save as..."
|
|
|
|
dialogue, etc.
|
|
|
|
|
2006-02-18 17:43:17 +01:00
|
|
|
* The ``Content-Disposition`` header starts with ``'attachment; '`` in this
|
|
|
|
example. This forces Web browsers to pop-up a dialog box
|
|
|
|
prompting/confirming how to handle the document even if a default is set
|
|
|
|
on the machine. If you leave off ``'attachment;'``, browsers will handle
|
|
|
|
the PDF using whatever program/plugin they've been configured to use for
|
|
|
|
PDFs. Here's what that code would look like::
|
|
|
|
|
|
|
|
response['Content-Disposition'] = 'filename=somefilename.pdf'
|
|
|
|
|
2005-10-01 00:02:51 +02:00
|
|
|
* Hooking into the ReportLab API is easy: Just pass ``response`` as the
|
|
|
|
first argument to ``canvas.Canvas``. The ``Canvas`` class expects a
|
2008-08-24 00:25:40 +02:00
|
|
|
file-like object, and :class:`~django.http.HttpResponse` objects fit the
|
|
|
|
bill.
|
2005-10-01 00:02:51 +02:00
|
|
|
|
|
|
|
* Note that all subsequent PDF-generation methods are called on the PDF
|
|
|
|
object (in this case, ``p``) -- not on ``response``.
|
|
|
|
|
|
|
|
* Finally, it's important to call ``showPage()`` and ``save()`` on the PDF
|
|
|
|
file.
|
2006-02-18 17:43:17 +01:00
|
|
|
|
|
|
|
Complex PDFs
|
|
|
|
============
|
|
|
|
|
|
|
|
If you're creating a complex PDF document with ReportLab, consider using the
|
2008-08-24 00:25:40 +02:00
|
|
|
cStringIO_ library as a temporary holding place for your PDF file. The cStringIO
|
|
|
|
library provides a file-like object interface that is particularly efficient.
|
|
|
|
Here's the above "Hello World" example rewritten to use ``cStringIO``::
|
2006-02-18 17:43:17 +01:00
|
|
|
|
2010-10-09 12:21:55 +02:00
|
|
|
# Fall back to StringIO in environments where cStringIO is not available
|
|
|
|
try:
|
|
|
|
from cStringIO import StringIO
|
|
|
|
except ImportError:
|
|
|
|
from StringIO import StringIO
|
2006-02-18 17:43:17 +01:00
|
|
|
from reportlab.pdfgen import canvas
|
2006-07-08 16:14:28 +02:00
|
|
|
from django.http import HttpResponse
|
2006-02-18 17:43:17 +01:00
|
|
|
|
|
|
|
def some_view(request):
|
|
|
|
# Create the HttpResponse object with the appropriate PDF headers.
|
|
|
|
response = HttpResponse(mimetype='application/pdf')
|
|
|
|
response['Content-Disposition'] = 'attachment; filename=somefilename.pdf'
|
|
|
|
|
2006-03-24 03:28:47 +01:00
|
|
|
buffer = StringIO()
|
2006-02-18 17:43:17 +01:00
|
|
|
|
|
|
|
# Create the PDF object, using the StringIO object as its "file."
|
|
|
|
p = canvas.Canvas(buffer)
|
|
|
|
|
|
|
|
# Draw things on the PDF. Here's where the PDF generation happens.
|
|
|
|
# See the ReportLab documentation for the full list of functionality.
|
|
|
|
p.drawString(100, 100, "Hello world.")
|
|
|
|
|
|
|
|
# Close the PDF object cleanly.
|
|
|
|
p.showPage()
|
|
|
|
p.save()
|
|
|
|
|
|
|
|
# Get the value of the StringIO buffer and write it to the response.
|
|
|
|
pdf = buffer.getvalue()
|
|
|
|
buffer.close()
|
|
|
|
response.write(pdf)
|
|
|
|
return response
|
|
|
|
|
2008-11-01 20:02:09 +01:00
|
|
|
.. _cStringIO: http://docs.python.org/library/stringio.html#module-cStringIO
|
2006-02-18 17:43:17 +01:00
|
|
|
|
|
|
|
Further resources
|
|
|
|
=================
|
|
|
|
|
|
|
|
* PDFlib_ is another PDF-generation library that has Python bindings. To
|
|
|
|
use it with Django, just use the same concepts explained in this article.
|
2009-12-26 07:37:26 +01:00
|
|
|
* `Pisa XHTML2PDF`_ is yet another PDF-generation library. Pisa ships with
|
2009-02-22 07:06:29 +01:00
|
|
|
an example of how to integrate Pisa with Django.
|
2006-02-18 17:43:17 +01:00
|
|
|
* HTMLdoc_ is a command-line script that can convert HTML to PDF. It
|
|
|
|
doesn't have a Python interface, but you can escape out to the shell
|
|
|
|
using ``system`` or ``popen`` and retrieve the output in Python.
|
|
|
|
|
|
|
|
.. _PDFlib: http://www.pdflib.org/
|
2009-12-26 07:37:26 +01:00
|
|
|
.. _`Pisa XHTML2PDF`: http://www.xhtml2pdf.com/
|
2006-02-18 17:43:17 +01:00
|
|
|
.. _HTMLdoc: http://www.htmldoc.org/
|
2008-08-24 00:25:40 +02:00
|
|
|
|
|
|
|
Other formats
|
|
|
|
=============
|
|
|
|
|
|
|
|
Notice that there isn't a lot in these examples that's PDF-specific -- just the
|
|
|
|
bits using ``reportlab``. You can use a similar technique to generate any
|
|
|
|
arbitrary format that you can find a Python library for. Also see
|
2010-08-19 21:27:44 +02:00
|
|
|
:doc:`/howto/outputting-csv` for another example and some techniques you can use
|
2008-08-24 00:25:40 +02:00
|
|
|
when generated text-based formats.
|