This is what quasar does
extraction, normalization, chunkification, blackhole read
This is the concept used to define the most representative part of a URL chunk in the url detection system.
Domainish
I'm the king of the world
Titanic
This table logs all IActionAbuseWriteTimeBlock
integrity_actions
This framework allows engs to create detection rules and respond to abuse in real time.
Integrity Activation Framework (IAF)
This is the normalization that transforms http://bucket.s3-aws-region.amazonaws.com to http://s3.amazonaws.com/bucket
Quirks Normalization
This is a URL relationship used for propagation in the Blackhole system.
Redirect Chain / One Click Away
With great power comes great responsibility.
Spider-Man (2002)
This table contains VPV events that are sampled for integrity measurement.
integrity_central_population_vpv
This system is originally designed to detect and flag potentially violating content on Meta's platforms using machine learning models.
Objectionable Content Investigation (OCIV)
This is the average QPS (queries per second) handled by Quasar.
50M (10-100M)
This team is responsible for taking care of implementing IActionDelete and the corresponding user experience.
The cold never bothered me anyway.
Frozen
This table logs daily blackhole mutations and blackhole mutation exceptions
si_blackhole_transition
This is the storage service for most ucc properties
tally counter
This is the technique used to optimize database reads in Quasar's blackhole service.
Bloom Filter
This is the concept used to calculate the most representative part of a url chunk
Business Type
Hakuna Matata
The Lion King
This table logs all outbound clicks every day for threads
outbound_click_events_ig
This is the average time that integrity crawler service takes to finish a scrape
10 minutes (5-30)
This SMC tier hosts Quasar's URL extraction service.
quasar_extraction
These are the two Ents used to store the Blackhole data.
EntSIURLChunkStatus and EntSIURLChunkViolationType
Why so serious?
Dark Knight
This table contains features extracted from Integrity Crawler scrapes
ws_controller
This is the main interface function that we use to get text from content
genFeatureMapTextArr