silverstripe-postgresql/docs/en/README.md

# PostgreSQL Database Module

## Features

Here is a quick list of what's different in the Postgres module (a full 
description follows afterwards):

* T-Search
* Extended index support
* Array data types
* Transactions
* Table partitioning
* Tablespaces
* Index clustering

If you don't know much about databases, or don't want to use any of the 
advanced features that this module provides, then you don't need to read 
any further.

The use of any of these features, especially the advanced options, implies 
that you have some level of comfort in administrating a Postgres database.

### T-Search

T-Search support is provided via both GiST and GIN. You can cluster and 
search columns with combinations of these methods. It is up to you to 
decide which is most appropriate for your data.

The dev/build process automatically creates a special column on each table, 
and a trigger is automatically set up to update this column whenever the 
targeted columns are changed. T-Search uses this column to return matches 
for search criteria.

Please see tutorial 4 for information how to enable fulltext search and the 
necessary controller hooks.

### Extended index support

Indexes have been extended to include support for more options. These new 
options include:

* The ability to specify index methods (btree/hash/). Btree is probably 
fine nearly all indexes, and it is the default. 'Unique' is also supported.
* Partial indexes. This is especially handy for creating an index while i
gnoring nulls or default data.
* Multiple column indexing. If your WHERE clauses always use the same 
columns, then you can create one index covering all of these at once.
* Fill factor. If your table content is static, then you can reduce the 
physical disk space your index uses. Also, if you use clustering, giving the 
fillfactor a low number may help performance for updates.

Examples:

**Hash index**:

> public static $indexes = array(
>    'Address'=>Array('type'=>'hash', 'name'=>'Address'),
> );

**Where clause**:

> public static $indexes = array(
>    'Address'=>Array('type'=>'unique', 'name'=>'Address', 'where'=>"\"Address\" IS NOT NULL"),
> );

**Fill factor**:

> public static $indexes = array(
>    'Address'=>Array('type'=>'unique', 'name'=>'Address', 'fillfactor'=>'50'),
> );

### Array data types

Nearly all data types in SilverStripe can now be expressed as an array. For 
example, you can specify an int as this:

> $db = array (
>     'Quantity'=>'Int[]'
> )

You would populate this like so:

> $item->Quantity='Array[1,2,3...]';

It also takes object literals if you're more familiar with that or it suits 
your purpose better, like this:

> $item->Quantity='{1,2,3}';

Using arrays as data types means that you can avoid join tables. This is not 
recommended if the SilverStripe ORM would expect a has_one or has_many etc under 
normal circumstances, but it could be useful in the case where you have a very 
large join table. You can also index these arrays with GIN indexes.

Please consult the official Postgres documentation for more information.

### Transactions

Transactions are supported at the database connection level. The relevant 
functions are:

* DB::get_conn()→startTransaction($transaction_mode, $session_characteristics)
* DB::get_conn()→transactionSavepoint($name)
* DB::get_conn()→transactionRollback($savepoint)
* DB::get_conn()→endTransaction();

You can create a savepoint by passing a name to the function, and then rollback 
either all of the uncommited transactions, or if you pass a savepoint name, 
jump back to the point you'd prefer.

$transaction_mode and $session_characteristic take the full range of isolation 
levels supported by Postgres.

Please consult the official Postgres documentation for more information.

### Table Partitioning

**This is an experimental feature.**

If you have a very large table, you can split it into many child tables. The 
advantages of this depend on your particular situation. Generally speaking, 
if your table is very large, queries should be faster.

You can create a partitioned table like this:

> public static $database_extensions = array(
>    'partitions'=>array(
>         'child_table_1'=>'NEW."ID">0 AND NEW."ID"<=100',
>         'child_table_2'=>'NEW."ID">100 AND NEW."ID"<=200'
>    )
> );

'NEW.' is a required part of the configuration string.

Partitioning should be set up right from the beginning. Partitioning a table 
which already has data may have unpredictable results.

Please consult the official Postgres documentation for more information.

### Tablespaces

**This is an experimental feature.**

Tablespaces are good for moving the physical files to a faster device (or slower 
and less used if that's a better option). You can set up a tablespace like this:

> public static $database_extensions = array(
>    'tablespace'=>Array('name'=>'fastspace', 'location'=>'/faster_location'),
> );

The '/faster_location' path must be owned by the postgres user. If you try to 
delete a tablespace via the 'drop tablespace' command, then this directory must
be empty.

Changing the location of the tablespace through the SilverStripe 
$database_extensions array will cause the dev/build process to attempt to delete 
the old location. An error message will be displayed if this location is not 
empty.

Please consult the official Postgres documentation for more information.

### Index Clustering

**This is an experimental feature.**

Index clustering allows you to reorganise the way rows are ordered inside a 
table according to an index specification. This can be a very intensive disk 
operation. You specify an index cluster like this:

> public static $database_extensions = array(
>    'cluster'=>'index_name'
> );

Clustering is only applied on a table on the second instance of a dev/build 
command being run on it (running a cluster command on an empty table is 
pointless).

Clustering needs to be reapplied on a regular basis if you're updating this 
table. You can also decrease the fillfactor on that index as well for 
potential performance gains.

As an alternative, clustering isn't necessary if you rebuild a table with 
an ORDER BY clause, where the ORDER BY column is the same as what you'd be 
clustering it by. The dev/build process does not do table rebuilds, so this 
is something you'd have to do yourself.

Please consult the official Postgres documentation for more information.

**A note about these advanced features**

The advanced features are here as an experimental offering. They have not 
been fully tested and their functionality and purpose may change in the 
future. They are primarily here to offer the ability to handle very large 
datasets.

They are also features which require the user to be very familiar with both 
Postgres and how their data works. If you can't predict how your database 
will be populated, then most of these features will be of little use.

## User contributed information

**Provided by dompie**

If you want to install this on a more secure postgresql server, go to 
PostgreSQLDatabase.php and set "public static $check_database_exists = false;"

Moreover you have to replace in PostgreSQLDatabaseConfigurationHelper.php 
occurrences of

> $connstring = "host=$server port=5432 dbname=postgres {$userPart}{$passwordPart}";

with

> $dbname = $databaseConfig['database']?$databaseConfig['database']: 'postgres';
> $connstring = "host=$server port=5432 dbname=$dbname {$userPart}{$passwordPart}";


Otherwise this extension will try to connect to "postgres" Database to check DB 
connection, no matter what you entered in the "Database Name" field during 
installation.

Make sure you have set the "search_path" correct for your database user.
MINOR: update Postgres doc for docviewer support. FIXES: #6681 2012-06-29 07:56:38 +02:00			`# PostgreSQL Database Module`

			`## Features`

			`Here is a quick list of what's different in the Postgres module (a full`
			`description follows afterwards):`

			`* T-Search`
			`* Extended index support`
			`* Array data types`
			`* Transactions`
			`* Table partitioning`
			`* Tablespaces`
			`* Index clustering`

			`If you don't know much about databases, or don't want to use any of the`
			`advanced features that this module provides, then you don't need to read`
			`any further.`

			`The use of any of these features, especially the advanced options, implies`
			`that you have some level of comfort in administrating a Postgres database.`

			`### T-Search`

			`T-Search support is provided via both GiST and GIN. You can cluster and`
			`search columns with combinations of these methods. It is up to you to`
			`decide which is most appropriate for your data.`

			`The dev/build process automatically creates a special column on each table,`
			`and a trigger is automatically set up to update this column whenever the`
			`targeted columns are changed. T-Search uses this column to return matches`
			`for search criteria.`

			`Please see tutorial 4 for information how to enable fulltext search and the`
			`necessary controller hooks.`

			`### Extended index support`

			`Indexes have been extended to include support for more options. These new`
			`options include:`

			`* The ability to specify index methods (btree/hash/). Btree is probably`
			`fine nearly all indexes, and it is the default. 'Unique' is also supported.`
			`* Partial indexes. This is especially handy for creating an index while i`
			`gnoring nulls or default data.`
			`* Multiple column indexing. If your WHERE clauses always use the same`
			`columns, then you can create one index covering all of these at once.`
			`* Fill factor. If your table content is static, then you can reduce the`
			`physical disk space your index uses. Also, if you use clustering, giving the`
			`fillfactor a low number may help performance for updates.`

			`Examples:`

			`Hash index:`

			`> public static $indexes = array(`
			`> 'Address'=>Array('type'=>'hash', 'name'=>'Address'),`
			`> );`

			`Where clause:`

			`> public static $indexes = array(`
			`> 'Address'=>Array('type'=>'unique', 'name'=>'Address', 'where'=>"\"Address\" IS NOT NULL"),`
			`> );`

			`Fill factor:`

			`> public static $indexes = array(`
			`> 'Address'=>Array('type'=>'unique', 'name'=>'Address', 'fillfactor'=>'50'),`
			`> );`

			`### Array data types`

			`Nearly all data types in SilverStripe can now be expressed as an array. For`
			`example, you can specify an int as this:`

			`> $db = array (`
			`> 'Quantity'=>'Int[]'`
			`> )`

			`You would populate this like so:`

			`> $item->Quantity='Array[1,2,3...]';`

			`It also takes object literals if you're more familiar with that or it suits`
			`your purpose better, like this:`

			`> $item->Quantity='{1,2,3}';`

			`Using arrays as data types means that you can avoid join tables. This is not`
			`recommended if the SilverStripe ORM would expect a has_one or has_many etc under`
			`normal circumstances, but it could be useful in the case where you have a very`
			`large join table. You can also index these arrays with GIN indexes.`

			`Please consult the official Postgres documentation for more information.`

			`### Transactions`

			`Transactions are supported at the database connection level. The relevant`
			`functions are:`

API Upgraded module to use new database ORM 2013-04-03 06:27:11 +02:00			`* DB::get_conn()→startTransaction($transaction_mode, $session_characteristics)`
			`* DB::get_conn()→transactionSavepoint($name)`
			`* DB::get_conn()→transactionRollback($savepoint)`
			`* DB::get_conn()→endTransaction();`
MINOR: update Postgres doc for docviewer support. FIXES: #6681 2012-06-29 07:56:38 +02:00
			`You can create a savepoint by passing a name to the function, and then rollback`
			`either all of the uncommited transactions, or if you pass a savepoint name,`
			`jump back to the point you'd prefer.`

			`$transaction_mode and $session_characteristic take the full range of isolation`
			`levels supported by Postgres.`

			`Please consult the official Postgres documentation for more information.`

			`### Table Partitioning`

			`This is an experimental feature.`

			`If you have a very large table, you can split it into many child tables. The`
			`advantages of this depend on your particular situation. Generally speaking,`
			`if your table is very large, queries should be faster.`

			`You can create a partitioned table like this:`

			`> public static $database_extensions = array(`
			`> 'partitions'=>array(`
			`> 'child_table_1'=>'NEW."ID">0 AND NEW."ID"<=100',`
			`> 'child_table_2'=>'NEW."ID">100 AND NEW."ID"<=200'`
			`> )`
			`> );`

			`'NEW.' is a required part of the configuration string.`

			`Partitioning should be set up right from the beginning. Partitioning a table`
			`which already has data may have unpredictable results.`

			`Please consult the official Postgres documentation for more information.`

			`### Tablespaces`

			`This is an experimental feature.`

			`Tablespaces are good for moving the physical files to a faster device (or slower`
			`and less used if that's a better option). You can set up a tablespace like this:`

			`> public static $database_extensions = array(`
			`> 'tablespace'=>Array('name'=>'fastspace', 'location'=>'/faster_location'),`
			`> );`

			`The '/faster_location' path must be owned by the postgres user. If you try to`
			`delete a tablespace via the 'drop tablespace' command, then this directory must`
			`be empty.`

			`Changing the location of the tablespace through the SilverStripe`
			`$database_extensions array will cause the dev/build process to attempt to delete`
			`the old location. An error message will be displayed if this location is not`
			`empty.`

			`Please consult the official Postgres documentation for more information.`

			`### Index Clustering`

			`This is an experimental feature.`

			`Index clustering allows you to reorganise the way rows are ordered inside a`
			`table according to an index specification. This can be a very intensive disk`
			`operation. You specify an index cluster like this:`

			`> public static $database_extensions = array(`
			`> 'cluster'=>'index_name'`
			`> );`

			`Clustering is only applied on a table on the second instance of a dev/build`
			`command being run on it (running a cluster command on an empty table is`
			`pointless).`

			`Clustering needs to be reapplied on a regular basis if you're updating this`
			`table. You can also decrease the fillfactor on that index as well for`
			`potential performance gains.`

			`As an alternative, clustering isn't necessary if you rebuild a table with`
			`an ORDER BY clause, where the ORDER BY column is the same as what you'd be`
			`clustering it by. The dev/build process does not do table rebuilds, so this`
			`is something you'd have to do yourself.`

			`Please consult the official Postgres documentation for more information.`

			`A note about these advanced features`

			`The advanced features are here as an experimental offering. They have not`
			`been fully tested and their functionality and purpose may change in the`
			`future. They are primarily here to offer the ability to handle very large`
			`datasets.`

			`They are also features which require the user to be very familiar with both`
			`Postgres and how their data works. If you can't predict how your database`
			`will be populated, then most of these features will be of little use.`

			`## User contributed information`

			`Provided by dompie`

			`If you want to install this on a more secure postgresql server, go to`
			`PostgreSQLDatabase.php and set "public static $check_database_exists = false;"`

			`Moreover you have to replace in PostgreSQLDatabaseConfigurationHelper.php`
			`occurrences of`

			`> $connstring = "host=$server port=5432 dbname=postgres {$userPart}{$passwordPart}";`

			`with`

			`> $dbname = $databaseConfig['database']?$databaseConfig['database']: 'postgres';`
			`> $connstring = "host=$server port=5432 dbname=$dbname {$userPart}{$passwordPart}";`


			`Otherwise this extension will try to connect to "postgres" Database to check DB`
			`connection, no matter what you entered in the "Database Name" field during`
			`installation.`

API Increased support to 9.2 Anything older than 9.3 is unsupported by Postgres: https://www.postgresql.org/support/versioning/. I don't think we should claim support in our module for unsupported versions, particularly if we don't have CI on them. Since we're only running CI on 9.2 at the moment, that's the safe claim for our module support, even though it's already unsupported by Postgres. See https://docs.travis-ci.com/user/database-setup/#using-a-different-postgresql-version I'll raise a separate ticket about testing and supporting newer versions, it's out of scope for this PR. 2019-09-03 05:41:23 +02:00			`Make sure you have set the "search_path" correct for your database user.`