<aside> đź’ˇ Whichever social media platform you gather data from, Phoenix will put it into a standard table. This table will be automatically connected to a dashboard that you can access in the visualise tab (more on that in this section).
</aside>
Contents of this page:
The standard table has one row for each post-comment pair. What does that mean?
For example, suppose you collect three posts only (with no comments), then your table will have the following rows:
Post 1 |
---|
Post 2 |
Post 3 |
Suppose you then collect the comments for posts 1 and 3. Post 1 has 2 comments and Post 3 has 3 comments. Now your table will have these rows:
Post 1 | Comment 1 |
---|---|
Post 1 | Comment 2 |
Post 2 | |
Post 3 | Comment 1 |
Post 3 | Comment 2 |
Post 3 | Comment 3 |
Every row in your table has the same columns, which we explain below.
We have normalised some columns across all social media platforms. This means that we have assumed that some things specific to a platform mean more or less the same thing. This makes it easier for you to compare data from different social media platforms. How we have normalised specific columns is explained in detail in the table below, but there are three normalisations that we use for all columns and that will help you understand the table:
We have marked some columns with “_pi” at the end to indicate that they contain personal information. We follow the definition of personal information dictated by GDPR. In order for Phoenix to be GDPR compliant, any column marked “_pi” will be deleted automatically from Phoenix after a specific amount of time (that you defined when you created your project).
<aside> đź’ˇ
When you are making graphs or tables in your dashboard (explained in this section), you may find it useful to have the table below open so you can check you are using the right data for your graphs and tables.
</aside>
Column name | Description |
---|---|
platform | The social media platform that this post or comment comes from. |
post_author_category | The category of the author as assigned by the platform. Not all platforms assign categories, but where this is available, we include it in this column. For example: TikTok commercial users have a category assigned by TikTok. |
post_author_class | If you create an author classification using Phoenix classify and you apply it to post authors, then it will appear in this column. |
post_author_description_pi | The description of the author as provided in the platform. Not all platforms allow us to collect this information, but if we can collect it, then we include it in this column. Note: this is personal information. |
post_author_followers_count | The number of other accounts following the author of the post. This is slightly different for each platform: TikTok account handle followers, Facebook page subscribers, Facebook group members, X handle followers, YouTube account subscribers, Instagram followers. |
post_author_id | A unique id number that Phoenix assigns to the post author. |
post_author_location | If the platform provides a location for the author, we include it in this column. |
post_author_name_pi | The name of the post author. Note: this is personal information. |
post_author_link_pi | The link to the author’s profile. Note: this is personal information. |
post_class | If you create a text feature or complex model classification using Phoenix classify and you apply it to posts, then it will appear in this column. |
post_comment_count | The number of comments on a post (at the time you ran the gather). |
post_date | The date when the post was made. |
post_gather_id | A unique id number that Phoenix assigns to the “gather” you used to get this post. It indicates that if you delete the gather with this id it will delete this post. |
post_id | A unique id number that Phoenix assigns to each post. It is not personal information and cannot be traced back to the real platform id for that post |
post_like_count | The number of likes on a post (at the time you ran the gather). This is slightly different for each platform: Facebook likes, TikTok diggCount, X likes / hearts, YouTube video likes, Instagram likes / hearts. |
post_link_pi | The link to the post. Note: this is personal information. |
post_share_count | The number of times a post is shared (at the time you run the gather). This is slightly different for each platform: Facebook shares, TikTok reposts, X retweets, YouTube video shares. (This is not relevant for Instagram.) You will not get this metric if you collect Facebook posts using a keyword search. |
post_text_pi | The text of the post. Note: this is personal information. |
comment_author_class | If you create an author classification using Phoenix classify and you apply it to comment authors, then it will appear in this column. |
comment_author_id | A unique id number that Phoenix assigns to the comment author. It is not personal information and cannot be traced back to the real platform id for that author. |
comment_author_name_pi | The name of the comment author. Note: this is personal information. |
comment_class | If you create a text feature or complex model classification using Phoenix classify and you apply it to comments, then it will appear in this column. |
comment_date | The date when the comment was made. |
comment_gather_id | A unique id number that Phoenix assigns to the “gather” you used to get this comment. It indicates that if you delete the gather with this id it will delete this comment. |
comment_id | A unique id number that Phoenix assigns to each comment. It is not personal information and cannot be traced back to the real platform id for the comment. |
comment_like_count | The number of likes on a comment (at the time you ran the gather). This is only available on some platforms. |
comment_link_pi | The link to the comment. This is only available on some platforms. Note: this is personal information. |
comment_parent_post_id | The id number of the original post under which this comment was written. |
comment_replied_to_id | If the comment is under another comment (as a reply), then the id of the comment it is replying to. If the comment is under a post, then the id of the post (that is, it will be the same as comment_parent_post_id). |
comment_text_pi | The text of the comment. Note: this is personal information. |
facebook_video_views | This is only for Facebook posts: where the post is a video, the number of times the video was viewed (at the time you ran the gather). |
tiktok_post_plays | This is only for TikTok posts: the number of times the video was played (at the time you ran the gather). |
x_tweet_quotes | This is only for X: the number of times the tweet was quoted (at the time you ran the gather). |
phoenix_processed_at | The Timestamp when this row was last updated. Currently, this will be the same for all rows. You won't be needing to use this row for analysis. |
phoenix_job_run_id | The id of phoenix pipeline job. This is used for developers to make it easier to find problems and can be ignored by an analyst. You won't be needing to use this row for analysis. |