Module functions and constants¶
The pg
module defines a few functions that allow to connect
to a database and to define “default variables” that override
the environment variables used by PostgreSQL.
These “default variables” were designed to allow you to handle general connection parameters without heavy code in your programs. You can prompt the user for a value, put it in the default variable, and forget it, without having to modify your environment.
All variables are set to None
at module initialization, specifying that
standard environment variables should be used.
connect – Open a PostgreSQL connection¶
- pg.connect([dbname][, host][, port][, opt][, user][, passwd][, nowait])¶
Open a
pg
connection- Parameters:
dbname – name of connected database (None =
defbase
)host (str or None) – name of the server host (None =
defhost
)port (int) – port used by the database server (-1 =
defport
)opt (str or None) – connection options (None =
defopt
)user (str or None) – PostgreSQL user (None =
defuser
)passwd (str or None) – password for user (None =
defpasswd
)nowait (bool) – whether the connection should happen asynchronously
- Returns:
If successful, the
Connection
handling the connection- Return type:
- Raises:
TypeError – bad argument type, or too many arguments
SyntaxError – duplicate argument definition
pg.InternalError – some error occurred during pg connection definition
Exception – (all exceptions relative to object allocation)
This function opens a connection to a specified database on a given
PostgreSQL server. You can use keywords here, as described in the
Python tutorial. The names of the keywords are the name of the
parameters given in the syntax line. The opt
parameter can be used
to pass command-line options to the server. For a precise description
of the parameters, please refer to the PostgreSQL user manual.
See Connection.poll()
for a description of the nowait
parameter.
If you want to add additional parameters not specified here, you must
pass a connection string or a connection URI instead of the dbname
(as in con3
and con4
in the following example).
Changed in version 5.2: Support for asynchronous connections via the nowait
parameter.
Example:
import pg
con1 = pg.connect('testdb', 'myhost', 5432, None, 'bob', None)
con2 = pg.connect(dbname='testdb', host='myhost', user='bob')
con3 = pg.connect('host=myhost user=bob dbname=testdb connect_timeout=10')
con4 = pg.connect('postgresql://bob@myhost/testdb?connect_timeout=10')
get_pqlib_version – get the version of libpq¶
- pg.get_pqlib_version()¶
Get the version of libpq that is being used by PyGreSQL
- Returns:
the version of libpq
- Return type:
int
- Raises:
TypeError – too many arguments
The number is formed by converting the major, minor, and revision numbers of the libpq version into two-decimal-digit numbers and appending them together. For example, version 15.4 will be returned as 150400.
Added in version 5.2.
get/set_defhost – default server host¶
- pg.get_defhost(host)¶
Get the default host
- Returns:
the current default host specification
- Return type:
str or None
- Raises:
TypeError – too many arguments
This method returns the current default host specification,
or None
if the environment variables should be used.
Environment variables won’t be looked up.
- pg.set_defhost(host)¶
Set the default host
- Parameters:
host (str or None) – the new default host specification
- Returns:
the previous default host specification
- Return type:
str or None
- Raises:
TypeError – bad argument type, or too many arguments
This methods sets the default host value for new connections.
If None
is supplied as parameter, environment variables will
be used in future connections. It returns the previous setting
for default host.
get/set_defport – default server port¶
- pg.get_defport()¶
Get the default port
- Returns:
the current default port specification
- Return type:
int
- Raises:
TypeError – too many arguments
This method returns the current default port specification,
or None
if the environment variables should be used.
Environment variables won’t be looked up.
- pg.set_defport(port)¶
Set the default port
- Parameters:
port (int) – the new default port
- Returns:
previous default port specification
- Return type:
int or None
This methods sets the default port value for new connections. If -1 is supplied as parameter, environment variables will be used in future connections. It returns the previous setting for default port.
get/set_defopt – default connection options¶
- pg.get_defopt()¶
Get the default connection options
- Returns:
the current default options specification
- Return type:
str or None
- Raises:
TypeError – too many arguments
This method returns the current default connection options specification,
or None
if the environment variables should be used. Environment variables
won’t be looked up.
- pg.set_defopt(options)¶
Set the default connection options
- Parameters:
options (str or None) – the new default connection options
- Returns:
previous default options specification
- Return type:
str or None
- Raises:
TypeError – bad argument type, or too many arguments
This methods sets the default connection options value for new connections.
If None
is supplied as parameter, environment variables will be used in
future connections. It returns the previous setting for default options.
get/set_defbase – default database name¶
- pg.get_defbase()¶
Get the default database name
- Returns:
the current default database name specification
- Return type:
str or None
- Raises:
TypeError – too many arguments
This method returns the current default database name specification, or
None
if the environment variables should be used. Environment variables
won’t be looked up.
- pg.set_defbase(base)¶
Set the default database name
- Parameters:
base (str or None) – the new default base name
- Returns:
the previous default database name specification
- Return type:
str or None
- Raises:
TypeError – bad argument type, or too many arguments
This method sets the default database name value for new connections. If
None
is supplied as parameter, environment variables will be used in
future connections. It returns the previous setting for default host.
get/set_defuser – default database user¶
- pg.get_defuser()¶
Get the default database user
- Returns:
the current default database user specification
- Return type:
str or None
- Raises:
TypeError – too many arguments
This method returns the current default database user specification, or
None
if the environment variables should be used. Environment variables
won’t be looked up.
- pg.set_defuser(user)¶
Set the default database user
- Parameters:
user – the new default database user
- Returns:
the previous default database user specification
- Return type:
str or None
- Raises:
TypeError – bad argument type, or too many arguments
This method sets the default database user name for new connections. If
None
is supplied as parameter, environment variables will be used in
future connections. It returns the previous setting for default host.
get/set_defpasswd – default database password¶
- pg.get_defpasswd()¶
Get the default database password
- Returns:
the current default database password specification
- Return type:
str or None
- Raises:
TypeError – too many arguments
This method returns the current default database password specification, or
None
if the environment variables should be used. Environment variables
won’t be looked up.
- pg.set_defpasswd(passwd)¶
Set the default database password
- Parameters:
passwd – the new default database password
- Returns:
the previous default database password specification
- Return type:
str or None
- Raises:
TypeError – bad argument type, or too many arguments
This method sets the default database password for new connections. If
None
is supplied as parameter, environment variables will be used in
future connections. It returns the previous setting for default host.
escape_string – escape a string for use within SQL¶
- pg.escape_string(string)¶
Escape a string for use within SQL
- Parameters:
string (str) – the string that is to be escaped
- Returns:
the escaped string
- Return type:
str
- Raises:
TypeError – bad argument type, or too many arguments
This function escapes a string for use within an SQL command.
This is useful when inserting data values as literal constants
in SQL commands. Certain characters (such as quotes and backslashes)
must be escaped to prevent them from being interpreted specially
by the SQL parser. escape_string()
performs this operation.
Note that there is also a Connection
method with the same name
which takes connection properties into account.
Note
It is especially important to do proper escaping when handling strings that were received from an untrustworthy source. Otherwise there is a security risk: you are vulnerable to “SQL injection” attacks wherein unwanted SQL commands are fed to your database.
Example:
name = input("Name? ")
phone = con.query("select phone from employees"
f" where name='{escape_string(name)}'").singlescalar()
escape_bytea – escape binary data for use within SQL¶
- pg.escape_bytea(datastring)¶
escape binary data for use within SQL as type
bytea
- Parameters:
datastring (bytes/str) – the binary data that is to be escaped
- Returns:
the escaped string
- Return type:
bytes/str
- Raises:
TypeError – bad argument type, or too many arguments
Escapes binary data for use within an SQL command with the type bytea
.
The return value will have the same type as the given datastring.
As with escape_string()
, this is only used when inserting data directly
into an SQL command string.
Note that there is also a Connection
method with the same name
which takes connection properties into account.
Example:
picture = open('garfield.gif', 'rb').read()
con.query(f"update pictures set img='{escape_bytea(picture)}'"
" where name='Garfield'")
unescape_bytea – unescape data that has been retrieved as text¶
- pg.unescape_bytea(string)¶
Unescape
bytea
data that has been retrieved as text- Parameters:
string (str) – the
bytea
string that has been retrieved as text- Returns:
byte string containing the binary data
- Return type:
bytes
- Raises:
TypeError – bad argument type, or too many arguments
Converts an escaped string representation of binary data stored as bytea
into the raw byte string representing the binary data – this is the reverse
of escape_bytea()
. Since the Query
results will already
return unescaped byte strings, you normally don’t have to use this method.
Note that there is also a DB
method with the same name
which does exactly the same.
get/set_decimal – decimal type to be used for numeric values¶
- pg.get_decimal()¶
Get the decimal type to be used for numeric values
- Returns:
the Python class used for PostgreSQL numeric values
- Return type:
class
This function returns the Python class that is used by PyGreSQL to hold
PostgreSQL numeric values. The default class is decimal.Decimal
.
- pg.set_decimal(cls)¶
Set a decimal type to be used for numeric values
- Parameters:
cls (class) – the Python class to be used for PostgreSQL numeric values
This function can be used to specify the Python class that shall
be used by PyGreSQL to hold PostgreSQL numeric values.
The default class is decimal.Decimal
.
get/set_decimal_point – decimal mark used for monetary values¶
- pg.get_decimal_point()¶
Get the decimal mark used for monetary values
- Returns:
string with one character representing the decimal mark
- Return type:
str
This function returns the decimal mark used by PyGreSQL to interpret
PostgreSQL monetary values when converting them to decimal numbers.
The default setting is '.'
as a decimal point. This setting is not
adapted automatically to the locale used by PostgreSQL, but you can use
set_decimal()
to set a different decimal mark manually. A return
value of None
means monetary values are not interpreted as decimal
numbers, but returned as strings including the formatting and currency.
Added in version 4.1.1.
- pg.set_decimal_point(string)¶
Specify which decimal mark is used for interpreting monetary values
- Parameters:
string (str) – string with one character representing the decimal mark
This function can be used to specify the decimal mark used by PyGreSQL
to interpret PostgreSQL monetary values. The default value is ‘.’ as
a decimal point. This value is not adapted automatically to the locale
used by PostgreSQL, so if you are dealing with a database set to a
locale that uses a ','
instead of '.'
as the decimal point,
then you need to call set_decimal(',')
to have PyGreSQL interpret
monetary values correctly. If you don’t want money values to be converted
to decimal numbers, then you can call set_decimal(None)
, which will
cause PyGreSQL to return monetary values as strings including their
formatting and currency.
Added in version 4.1.1.
get/set_bool – whether boolean values are returned as bool objects¶
- pg.get_bool()¶
Check whether boolean values are returned as bool objects
- Returns:
whether or not bool objects will be returned
- Return type:
bool
This function checks whether PyGreSQL returns PostgreSQL boolean
values converted to Python bool objects, or as 'f'
and 't'
strings which are the values used internally by PostgreSQL. By default,
conversion to bool objects is activated, but you can disable this with
the set_bool()
function.
Added in version 4.2.
- pg.set_bool(on)¶
Set whether boolean values are returned as bool objects
- Parameters:
on – whether or not bool objects shall be returned
This function can be used to specify whether PyGreSQL shall return
PostgreSQL boolean values converted to Python bool objects, or as
'f'
and 't'
strings which are the values used internally by
PostgreSQL. By default, conversion to bool objects is activated,
but you can disable this by calling set_bool(True)
.
Added in version 4.2.
Changed in version 5.0: Boolean values had been returned as string by default in earlier versions.
get/set_array – whether arrays are returned as list objects¶
- pg.get_array()¶
Check whether arrays are returned as list objects
- Returns:
whether or not list objects will be returned
- Return type:
bool
This function checks whether PyGreSQL returns PostgreSQL arrays converted
to Python list objects, or simply as text in the internal special output
syntax of PostgreSQL. By default, conversion to list objects is activated,
but you can disable this with the set_array()
function.
Added in version 5.0.
- pg.set_array(on)¶
Set whether arrays are returned as list objects
- Parameters:
on – whether or not list objects shall be returned
This function can be used to specify whether PyGreSQL shall return PostgreSQL
arrays converted to Python list objects, or simply as text in the internal
special output syntax of PostgreSQL. By default, conversion to list objects
is activated, but you can disable this by calling set_array(False)
.
Added in version 5.0.
Changed in version 5.0: Arrays had been always returned as text strings in earlier versions.
get/set_bytea_escaped – whether bytea data is returned escaped¶
- pg.get_bytea_escaped()¶
Check whether bytea values are returned as escaped strings
- Returns:
whether or not bytea objects will be returned escaped
- Return type:
bool
This function checks whether PyGreSQL returns PostgreSQL bytea
values in
escaped form or in unescaped from as byte strings. By default, bytea values
will be returned unescaped as byte strings, but you can change this with the
set_bytea_escaped()
function.
Added in version 5.0.
- pg.set_bytea_escaped(on)¶
Set whether bytea values are returned as escaped strings
- Parameters:
on – whether or not bytea objects shall be returned escaped
This function can be used to specify whether PyGreSQL shall return
PostgreSQL bytea
values in escaped form or in unescaped from as byte
strings. By default, bytea values will be returned unescaped as byte
strings, but you can change this by calling set_bytea_escaped(True)
.
Added in version 5.0.
Changed in version 5.0: Bytea data had been returned in escaped form by default in earlier versions.
get/set_jsondecode – decoding JSON format¶
- pg.get_jsondecode()¶
Get the function that deserializes JSON formatted strings
This returns the function used by PyGreSQL to construct Python objects from JSON formatted strings.
- pg.set_jsondecode(func)¶
Set a function that will deserialize JSON formatted strings
- Parameters:
func – the function to be used for deserializing JSON strings
You can use this if you do not want to deserialize JSON strings coming
in from the database, or if want to use a different function than the
standard function json.loads()
or if you want to use it with parameters
different from the default ones. If you set this function to None, then
the automatic deserialization of JSON strings will be deactivated.
Added in version 5.0.
Changed in version 5.0: JSON data had been always returned as text strings in earlier versions.
get/set_datestyle – assume a fixed date style¶
- pg.get_datestyle()¶
Get the assumed date style for typecasting
This returns the PostgreSQL date style that is silently assumed when
typecasting dates or None if no fixed date style is assumed, in which case
the date style is requested from the database when necessary (this is the
default). Note that this method will not get the date style that is
currently set in the session or in the database. You can get the current
setting with the methods DB.get_parameter()
and
Connection.parameter()
. You can also get the date format corresponding
to the current date style by calling Connection.date_format()
.
Added in version 5.0.
- pg.set_datestyle(datestyle)¶
Set a fixed date style that shall be assumed when typecasting
- Parameters:
datestyle (str) – the date style that shall be assumed, or None if no fixed dat style shall be assumed
PyGreSQL is able to automatically pick up the right date style for typecasting
date values from the database, even if you change it for the current session
with a SET DateStyle
command. This is happens very effectively without
an additional database request being involved. If you still want to have
PyGreSQL always assume a fixed date style instead, then you can set one with
this function. Note that calling this function will not alter the date
style of the database or the current session. You can do that by calling
the method DB.set_parameter()
instead.
Added in version 5.0.
get/set_typecast – custom typecasting¶
PyGreSQL uses typecast functions to cast the raw data coming from the database to Python objects suitable for the particular database type. These functions take a single string argument that represents the data to be casted and must return the casted value.
PyGreSQL provides through its C extension module basic typecast functions for the common database types, but if you want to add more typecast functions, you can set these using the following functions.
- pg.get_typecast(typ)¶
Get the global cast function for the given database type
- Parameters:
typ (str) – PostgreSQL type name
- Returns:
the typecast function for the specified type
- Return type:
function or None
Added in version 5.0.
- pg.set_typecast(typ, cast)¶
Set a global typecast function for the given database type(s)
- Parameters:
typ (str or int) – PostgreSQL type name or list of type names
cast – the typecast function to be set for the specified type(s)
The typecast function must take one string object as argument and return a Python object into which the PostgreSQL type shall be casted. If the function takes another parameter named connection, then the current database connection will also be passed to the typecast function. This may sometimes be necessary to look up certain database settings.
Added in version 5.0.
Note that database connections cache types and their cast functions using
connection specific DbTypes
objects. You can also get, set and
reset typecast functions on the connection level using the methods
DbTypes.get_typecast()
, DbTypes.set_typecast()
and
DbTypes.reset_typecast()
of the DB.dbtypes
object. This will
not affect other connections or future connections. In order to be sure
a global change is picked up by a running connection, you must reopen it or
call DbTypes.reset_typecast()
on the DB.dbtypes
object.
Also note that the typecasting for all of the basic types happens already in the C extension module. The typecast functions that can be set with the above methods are only called for the types that are not already supported by the C extension module.
cast_array/record – fast parsers for arrays and records¶
PostgreSQL returns arrays and records (composite types) using a special output syntax with several quirks that cannot easily and quickly be parsed in Python. Therefore the C extension module provides two fast parsers that allow quickly turning these text representations into Python objects: Arrays will be converted to Python lists, and records to Python tuples. These fast parsers are used automatically by PyGreSQL in order to return arrays and records from database queries as lists and tuples, so you normally don’t need to call them directly. You may only need them for typecasting arrays of data types that are not supported by default in PostgreSQL.
- pg.cast_array(string[, cast][, delim])¶
Cast a string representing a PostgreSQL array to a Python list
- Parameters:
string (str) – the string with the text representation of the array
cast (callable or None) – a typecast function for the elements of the array
delim (bytes) – delimiter character between adjacent elements
- Returns:
a list representing the PostgreSQL array in Python
- Return type:
list
- Raises:
TypeError – invalid argument types
ValueError – error in the syntax of the given array
This function takes a string containing the text representation of a
PostgreSQL array (which may look like '{{1,2}{3,4}}'
for a two-dimensional
array), a typecast function cast that is called for every element, and
an optional delimiter character delim (usually a comma), and returns a
Python list representing the array (which may be nested like
[[1, 2], [3, 4]]
in this example). The cast function must take a single
argument which will be the text representation of the element and must output
the corresponding Python object that shall be put into the list. If you don’t
pass a cast function or set it to None, then unprocessed text strings will
be returned as elements of the array. If you don’t pass a delimiter character,
then a comma will be used by default.
Added in version 5.0.
- pg.cast_record(string[, cast][, delim])¶
Cast a string representing a PostgreSQL record to a Python tuple
- Parameters:
string (str) – the string with the text representation of the record
cast (callable, list or tuple of callables, or None) – typecast function(s) for the elements of the record
delim (bytes) – delimiter character between adjacent elements
- Returns:
a tuple representing the PostgreSQL record in Python
- Return type:
tuple
- Raises:
TypeError – invalid argument types
ValueError – error in the syntax of the given array
This function takes a string containing the text representation of a
PostgreSQL record (which may look like '(1,a,2,b)'
for a record composed
of four fields), a typecast function cast that is called for every element,
or a list or tuple of such functions corresponding to the individual fields
of the record, and an optional delimiter character delim (usually a comma),
and returns a Python tuple representing the record (which may be inhomogeneous
like (1, 'a', 2, 'b')
in this example). The cast function(s) must take a
single argument which will be the text representation of the element and must
output the corresponding Python object that shall be put into the tuple. If
you don’t pass cast function(s) or pass None instead, then unprocessed text
strings will be returned as elements of the tuple. If you don’t pass a
delimiter character, then a comma will be used by default.
Added in version 5.0.
Note that besides using parentheses instead of braces, there are other subtle differences in escaping special characters and NULL values between the syntax used for arrays and the one used for composite types, which these functions take into account.
Type helpers¶
The module provides the following type helper functions. You can wrap
parameters with these functions when passing them to DB.query()
or DB.query_formatted()
in order to give PyGreSQL a hint about the
type of the parameters, if it cannot be derived from the context.
- pg.Bytea(bytes)¶
A wrapper for holding a bytea value
Added in version 5.0.
- pg.HStore(dict)¶
A wrapper for holding an hstore dictionary
Added in version 5.0.
- pg.Json(obj)¶
A wrapper for holding an object serializable to JSON
Added in version 5.0.
The following additional type helper is only meaningful when used with
DB.query_formatted()
. It marks a parameter as text that shall be
literally included into the SQL. This is useful for passing table names
for instance.
- pg.Literal(sql)¶
A wrapper for holding a literal SQL string
Added in version 5.0.
Module constants¶
Some constants are defined in the module dictionary. They are intended to be used as parameters for methods calls. You should refer to the libpq description in the PostgreSQL user manual for more information about them. These constants are:
- pg.version¶
- pg.__version__¶
constants that give the current version
- pg.INV_READ¶
- pg.INV_WRITE¶
large objects access modes, used by
Connection.locreate()
andLargeObject.open()
- pg.POLLING_OK¶
- pg.POLLING_FAILED¶
- pg.POLLING_READING¶
- pg.POLLING_WRITING¶
polling states, returned by
Connection.poll()
- pg.SEEK_SET¶
- pg.SEEK_CUR¶
- pg.SEEK_END¶
positional flags, used by
LargeObject.seek()
- pg.TRANS_IDLE¶
- pg.TRANS_ACTIVE¶
- pg.TRANS_INTRANS¶
- pg.TRANS_INERROR¶
- pg.TRANS_UNKNOWN¶
transaction states, used by
Connection.transaction()