Snowflake Data Fields

After you've imported your HUMAN data into Snowflake, you can view the data fields listed below, which correspond to each Snowflake view.

The following Snowflake views are available:

  • mediaguard_prebid and mediaguard_prebid_sampled, which include pre-bid MediaGuard data.
  • mediaguard_postbid, which includes post-bid FraudSensor data.
  • mediaguard_prebid_postbid_combined_global, which includes pre-bid MediaGuard and post-bid FraudSensor joined data.

📘

Note

The mediaguard_prebid view pulls data from all MediaGuard requests during the selected time period, while the mediaguard_prebid_sampled view pulls data from a smaller subset of requests to provide more detailed information about a limited data pool. Certain pre-bid fields are only available for sampled data.


MediaGuard (pre-bid) fields

  • App ID (app_id): (sampled data only) App bundle where the ad was intended to be delivered, as provided by the publisher (for in-app mobile environments only).
  • App Last Checked (insights_removed_app_last_checked): (sampled data only) Date and time the app was last checked to be present at the relevant app store.
  • App Last Found (insights_removed_app_last_found): (sampled data only) Date and time the app was last seen at the relevant app store.
  • Bots (ivt_bots): Number of bid requests identified as invalid traffic (IVT).
  • Compact SChain (compact_schain): (sampled data only) SupplyChain Object provided in your request to MediaGuard, with optional fields removed.
  • Customer ID (customer_id): Unique identifier associated with your HUMAN account.
  • Device Type (device_type): Device type as provided in your request to MediaGuard.
  • Domain (domain): (sampled data only) Domain where the ad is intended to be delivered.
  • GIVT (givt_bots): Number of bid requests categorized as General IVT (GIVT).
  • High Incentivized Traffic App (insights_high_incentivized_app): (sampled data only) App has a high level of traffic with incentivized IPs.
  • High Incentivized Traffic Site (insights_high_incentivized_site): (sampled data only) Site has a high level of traffic with incentivized IPs or UTMs.
  • High Paid or Social Sourcing Site (insights_high_paid_or_social_src_site): (sampled data only) Site has a high level of paid or social traffic sourcing.
  • High Paid Sourcing Site (insights_high_paid_src_site): (sampled data only) Site has a high level of paid traffic sourcing.
  • High Social Sourcing Site (insights_high_social_src_site): (sampled data only) Site has a high level of social traffic sourcing.
  • IP Address: IP address where the ad impression (and HUMAN's detection tag) is intended to be delivered.
  • IVT - Automated Browsing (ivt_bot_category_ab): Invalid requests driven by automated programs or scripts that did not declare themselves as crawlers.
  • IVT - Data Center (ivt_bot_category_dc): Invalid requests originating from data center IPs linked to invalid activity.
  • IVT - False Representation (ivt_bot_category_fr): Invalid requests where requested content differed from actual content supplied.
  • IVT - Irregular Pattern (ivt_bot_category_fr): Invalid requests associated with irregular, nonhuman behavior.
  • IVT - Known Crawler (ivt_bot_category_kc): Invalid requests driven by non-malicious automated scripts declaring themselves as nonhuman.
  • IVT - Misleading User Interface (ivt_bot_category_mui): Requests resulting from interfaces falsely including ads in unintended locations.
  • IVT - Undisclosed Classification (ivt_bot_category_uc): Invalid requests that could not be classified into other categories.
  • IVT Categories (ivt_bot_category, ivt_bot_category_str): IVT categories associated with the request.
  • IVT Sub Categories (ivt_bot_subcategory, ivt_bot_subcategory_str): IVT subcategories associated with the request.
  • Media Type (media_type): Media type as provided in your request to MediaGuard.
  • MFA (insights_mfa): (sampled data only) Domain or site show significant evidence of MFA. Set automatically or by manual review.
  • MFA Domain (insights_mfa_domain): (sampled data only) Domain shows significant evidence of MFA. Set automatically or by manual review.
  • MFA Domain Flag Overridden (insights_mfa_flag_overridden_domain): (sampled data only) Domain flagged as MFA through automated indicators but overridden by manual review.
  • MFA Domain Indicators (insights_mfa_domain_indicators): (sampled data only) Indicators of MFA identified on this domain. “Manual Review” indicates the domain was manually reviewed by HUMAN and significant evidence of MFA was found.
  • MFA Flag Overridden (insights_mfa_flag_overridden): (sampled data only) Domain or site flagged as MFA through automated indicators but overridden by manual review.
  • MFA Site (insights_mfa_site): (sampled data only) Site shows significant evidence of MFA. Set automatically or by manual review.
  • MFA Site Flag Overridden (insights_mfa_flag_overridden_site): (sampled data only) Site flagged as MFA through automated indicators but overridden by manual review.
  • MFA Site Indicators (insights_mfa_site_indicators): (sampled data only) Indicators of MFA identified on this site. “Manual Review” indicates the site was manually reviewed by HUMAN and significant evidence of MFA was found.
  • Publisher ID (publisher_id): Publisher ID as provided in your request to MediaGuard.
  • Removed App (insights_removed_app): (sampled data only) App removed from the relevant app store.
  • SChain (schain): (sampled data only) SupplyChain Object provided in your request to MediaGuard.
  • SIVT (sivt_bots): Number of bid requests identified as Sophisticated IVT (SIVT).
  • Supplier ID (supplier_id): Supplier ID as provided in your request to MediaGuard.
  • Timestamp (timestamp): Date and time the request was recorded by HUMAN.
  • Total Requests (requests): Total number of bid requests.
  • User Agent: (user_agent) User agent of the user who drove the request.
  • Valid Requests (valid_requests): Number of valid bid requests.

FraudSensor (post-bid) fields

  • App Mismatch (app_mismatch): Indicates a mismatch between the app ID reported by the device and the actual app ID detected (for in-app mobile environments only).
  • Bot (bot): Indicates if the impression was identified as invalid traffic (IVT).
  • Context (st_ad_context): Context of the ad delivery (e.g., web, in-app).
  • Country (country): Country where the impression originated.
  • Customer ID (customer_id): Unique identifier associated with your HUMAN account.
  • Customer Tag (ct): Parameters passed to HUMAN’s detection tag.
  • Ad ID/Placement ID (pi): Unique identifier for the digital advertisement.
  • Advertiser (ai): Advertiser or brand running the ad campaign.
  • Buy Type (bt): Indicates if the ad was purchased programmatically or directly.
  • Campaign ID (ac): Unique identifier for tracking the campaign.
  • Custom Fields (c1 - c10): Custom parameters passed to HUMAN’s tag.
  • Dimensions (dm): Ad creative dimensions (width x height, in pixels).
  • Domain Intended (di): Domain where the ad was intended to be delivered.
  • Prevention ID (pv): Corresponding MediaGuard prevention ID.
  • Publisher ID (pp): Publisher of the traffic source.
  • Reporting Fields (r1 - r10): Custom data for reporting via HUMAN’s Reporting API.
  • Supplier ID (sr): Supplier domain of the traffic source.
  • Tag ID (dt): Identifier of the detection tag that detected the impression.
  • Transaction ID (ti): Unique identifier for each impression.
  • Detected App ID (detected_app_id): Detected app bundle for the ad delivery (in-app mobile environments only).
  • Device Type (st_device): Software/device type (e.g., Android, Roku).
  • Domain Mismatch (mismatch): Indicates a mismatch between the Intended Domain and True Domain fields.
  • Href Domain (href_domain): Domain of the page or iframe where the impression occurred.
  • Href URL Parameters (href_fields): URL query parameters attached to the page URL.
  • Page (page): Webpage where the impression occurred.
  • Referring Domain (ref_domain): Domain referring the user to the impression site.
  • True Domain (true_domain): Domain where HUMAN's detection tag was attached.
  • Environment (environment): Environment of ad delivery (mobile web, desktop).
  • Event Type (event_type_str): Detected event type (impression or click).
  • GIVT (givt): Indicates if the impression was identified as General IVT (GIVT).
  • Href URL (href_url): Full URL of the page or iframe where the impression occurred.
  • IVT Categories (ivt_bot_category_str): IVT categories associated with the impression.
  • IVT Sub Categories (ivt_bot_subcategory, ivt_bot_subcategory_str): IVT subcategories associated with the request.
  • Media Type (st_media_type): Media type of the inventory (e.g., banner, video).
  • OS Name (ua_os_name): Operating system of the delivery device.
  • Platform Type (platform): Platform type (e.g., desktop, mobile).
  • Session ID (session_id): Unique identifier for each event.
  • SIVT (sivt): Indicates if the impression was identified as Sophisticated IVT (SIVT).
  • SSAI (is_ssai): Indicates if the ad was delivered via server-side ad insertion.
  • Surface (st_surface): Device category (e.g., mobile, desktop).
  • Tag Type (tag_type): Type of FraudSensor tag that detected the event.
  • Timestamp (timestamp): Date and time the impression was recorded.
  • UTM Campaign (utm_campaign): UTM campaign parameter from the impression URL.
  • UTM Content (utm_content): UTM content parameter from the impression URL.
  • UTM Medium (utm_medium): UTM medium parameter from the impression URL.
  • UTM Source (utm_source): UTM source parameter from the impression URL.
  • UTM Term (utm_term): UTM term parameter from the impression URL.

FraudSensor + MediaGuard fields

  • Customer ID (customer_id): (sampled data only) Unique identifier associated with your HUMAN account.
  • FS Domain (fs_domain): (sampled data only) Domain of the page where HUMAN’s detection tag was attached.
  • FS Root Domain (fs_domain_root): (sampled data only) Root domain of the page where HUMAN’s detection tag was attached. If it was attached inside an iframe, this field may not be populated.
  • FS+MG Domain Mismatch (insights_domain_mismatch): (sampled data only) Mismatch between the domain in the URL parameter you passed to MediaGuard and the FS Domain detected by HUMAN.
  • FS+MG IP Mismatch (insights_ip_mismatch): (sampled data only) Mismatch between the IP address in the ip parameter you passed to MediaGuard and the IP address detected by HUMAN.
  • FS+MG Root Domain Mismatch (insights_domain_mismatch_root): (sampled data only) Mismatch between the domain in the URL parameter you passed to MediaGuard and the FS Domain detected by HUMAN, where the root domains (eTLD+1) differ.
  • FS+MG Subdomain Mismatch (insights_domain_mismatch_sub): (sampled data only) Mismatch between the domain in the URL parameter you passed to MediaGuard and the FS Domain detected by HUMAN, where the root domains (eTLD+1) match but the subdomains differ.
  • MG Domain (mg_domain): (sampled data only) Full domain where the ad was intended to be delivered.
  • MG IP Address (mg_ip): (sampled data only) IP address as provided in your request to MediaGuard.
  • MG Publisher ID (mg_publisher_id): (sampled data only) Publisher ID as provided in your request to MediaGuard.
  • MG Root Domain (mg_domain_root): (sampled data only) Root domain where the ad was intended to be delivered.
  • MG Seller Domain (mg_seller_domain): (sampled data only) Seller domain from sellers.json.
  • MG Seller Name (mg_seller_type): (sampled data only) Seller type from sellers.json.
  • MG Seller Type (mg_seller_name): (sampled data only) Seller name from sellers.json.
  • MG Supplier ID (mg_supplier_id): (sampled data only) Supplier ID as provided in your request to MediaGuard.
  • MG Timestamp (mg_timestamp): (sampled data only) Date and time HUMAN recorded the request.
  • Timestamp (fs_timestamp): (sampled data only) Date and time HUMAN recorded the impression.