0
0
mirror of https://github.com/PostHog/posthog.git synced 2024-12-01 12:21:02 +01:00
posthog/ee/clickhouse
Michael Matloka 2f5266e2cf
Funnel time to convert query (#4947)
* Create funnel_time_to_convert.py

* Create test_funnel_time_to_convert.py

* Remove unused things

* Rework time to convert query to use our own SQL

* Add comments to query

* Fix typo in comment

Co-authored-by: Neil Kakkar <neilkakkar@gmail.com>

* Use feedback

* Only override `ClickhouseFunnelTimeToConvert._format_results`

* Switch test users B and C around

* Fix typo

Co-authored-by: Neil Kakkar <neilkakkar@gmail.com>
Co-authored-by: eric <eeoneric@gmail.com>
2021-07-01 13:41:11 -04:00
..
migrations add migration that migrates the table again without sharding (#4604) 2021-06-07 11:39:30 +03:00
models Handle cohort being filtered on not existing (#4961) 2021-07-01 13:30:18 -04:00
queries Funnel time to convert query (#4947) 2021-07-01 13:41:11 -04:00
sql Speed up /api/person/properties under clickhouse (#4940) 2021-07-01 12:31:23 +03:00
test Remove process_event_ee (#4063) 2021-04-23 14:10:28 +02:00
views Make funnel persons api call return values structure consistent with others (#4957) 2021-07-01 10:04:40 -04:00
__init__.py Fix Master EE code (#1701) 2020-09-24 06:14:17 -04:00
client.py Only send error class name to statsd (#4902) 2021-06-29 23:50:29 +02:00
demo.py Populate a lone session recording for demo (#3128) 2021-01-29 14:28:55 +01:00
generate_local.py Generate local tests cases (#4874) 2021-06-25 12:38:04 -04:00
middleware.py Rename is_ee_enabled to is_clickhouse_enabled (#4216) 2021-05-13 16:56:54 +02:00
README.md Add --plan and --fake flags to migrate_clickhouse (#4160) 2021-04-29 15:40:12 +03:00
system_status.py Add running & slow queries to instance status dashboards (#4420) 2021-05-21 09:26:01 +03:00
util.py Create collapsing table for cohortpeople (#4477) 2021-05-27 17:44:43 +03:00

Clickhouse Support (Enterprise Feature)

To accomodate high volume deployments, Posthog can use Clickhouse instead of Postgres. Clickhouse isn't used by default because Postgres is easier to deploy and maintain on smaller instances and on platforms such as Heroku.

Clickhouse Support works by swapping in separate queries and classes in the system for models that are most impacted by high volume usage (ex: events and actions).

Migrations and Models

The django_clickhouse orm is used to manage migrations and models. The ORM is used to mimic the django model and migration structure in the main folder.

Certain migrations (e.g. changing table engines) can be quite expensive and tricky, especially for deployments outside of cloud. To skip these steps during deployment setup, check for the CLICKHOUSE_INITIAL_MIGRATIONS environment variable.

If you need help in making them happen, ask for help from team deployments.

Queries

Queries parallel the queries folder in the main folder however, clickhouse queries are written in SQL and do not utilize the ORM.

Tests

The tests are inherited from the main folder. The Clickhouse query classes are based off BaseQuery so their run function should work just as the Django ORM backed query classes. These classes are called with the paramterized tests declared in the main folder which allows the same suite of tests to be run with different implementations.

Views

Views contain Viewset classes that are not backed by models. Instead the views query Clickhouse tables using SQL. These views match the interface provide by the views in the main folder.