0
0
mirror of https://github.com/PostHog/posthog.git synced 2024-11-24 18:07:17 +01:00
posthog/ee/idl/element.avro
Marius Andra dd7e38c5b5
Clickhouse Elements Dedup (based on master) (#1698)
* Use ReplacingMergeTree for elements, remove element_groups and use elements_hash as a virtual "pk"

* remove unused ELEMENT_GROUP_TABLE_SQL

* merge fixes

* use redis cache to avoid writing duplicate elements to clickhouse

* move fakeredis to requirements.txt

* add team_id to cache key

* remove elements_group kafka table references

* add elements_hash to clickhouse element serializer

* fix cache key

* rename few keys

* add test runner to ease pycharm dev

* fix a some mypy errors

* remove typo

Co-authored-by: Eric <eeoneric@gmail.com>
2020-09-24 06:47:28 -04:00

54 lines
821 B
Plaintext

{
"namespace": "com.posthog.ee",
"name": "element",
"type": "record",
"fields" : [
{
"name" : "text",
"type" : "string"
},
{
"name" : "tag_name",
"type" : "string"
},
{
"name" : "href",
"type" : "string"
},
{
"name" : "attr_id",
"type" : "string"
},
{
"name" : "attr_class",
"type" : "string"
},
{
"name" : "nth_child",
"type" : "string"
},
{
"name" : "nth_of_type",
"type" : "string"
},
{
"name" : "attributes",
"type" : "map",
"values" : "string"
},
{
"name" : "order",
"type" : "long"
},
{
"name" : "team_id",
"type" : "long"
},
{
"name" : "elements_hash",
"type" : "string"
}
]
}