I figured I've done threads for almost everything else I'm tagging, so I might as well make one for this.
I'd like to start by saying/asking that if I've missed any that I could tag, please let me know. I'd greatly appreciate it.
I'm currently using text recognition to tag:
text
english_text
url
pet_praise
apology
grawlixes
dirty_talk
profanity
?!
greeting
holiday_message
For english_text I've got a small dictionary of common words that are over four characters long. Any less and Chinese/Japanese characters cause mis-tagging.
For apology I'm tagging posts that include "apologies", or "sorry" without "but". Does that seem ok, or should I change it? Or are there others I should add?
For dirty_talk I looked over the tag and picked a few phrases, such as "fill me with your" and including "cum/seed". I've got one or two, but any you can think of would be helpful.
For greeting I've mostly stuck with "hello", I originally had "hey" but it could be more of a shout of surprise. "Hi" seemed good until it tagged any skyscraper with it, as the concrete between windows make "H" and "I" shapes, like with post #1875087. "Sup", I thought was a bit vague, " 'sup" could be better. I'm also tagging "gday" and "g'day".
And for holiday_message I've tagged most of them, but should I tag mother's/father's day? They're specific days, but not really holidays. Also, mother's_day implies holidays, but I'm still unsure if "happy mother's day" is really a holiday_message.
Then these are some I'm not so sure how to tag:
question
threat
wall_of_text
For question, could I tag anything with a question mark? I felt like that was a bit obvious, but wasn't sure whether to say it also had to include, "what/why/when/who/how/are you", or if the question mark was enough.
For threat, I can't think of how to tag it. I thought of different phrases, like dirty_talk, but often they could be used in other contexts. Something like "do x or I'll y", seems ok, but I'm still uncertain on it.
And lastly, wall_of_text, at first I thought it'd be easy, count the words in the current wall_of_text posts, then apply that to others. But then it's surprising how many comics would get mis-tagged, that definitely aren't walls of text. So I feel there isn't an easy way to tag this, as it's more on how much the text fills the image, rather than just the character count.
Again, any ideas for alterations, improvements or more things I could tag would be greatly appreciated!
Edit:
I included "(Tag suggestions/improvements appreciated!)" in the title as I didn't want people thinking I was asking how to go about automating the tagging, rather than looking for suggestions, though I'm not keen on the wording.
Updated