{"id":7782,"date":"2024-01-10T12:24:36","date_gmt":"2024-01-10T17:24:36","guid":{"rendered":"https:\/\/www.coginov.com\/?p=7782"},"modified":"2024-01-10T19:01:11","modified_gmt":"2024-01-11T00:01:11","slug":"qoreaudit-the-mechanics-of-the-semantic-contextualization","status":"publish","type":"post","link":"https:\/\/www.coginov.com\/en\/qoreaudit-the-mechanics-of-the-semantic-contextualization\/","title":{"rendered":"QoreAudit: The mechanics of the semantic contextualization."},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"7782\" class=\"elementor elementor-7782 elementor-7781\" data-elementor-post-type=\"post\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-a967114 elementor-section-boxed elementor-section-height-default elementor-section-height-default marketum_parallax_no\" data-id=\"a967114\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-c8e1c5e\" data-id=\"c8e1c5e\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-2751f28 elementor-widget elementor-widget-marketum_heading\" data-id=\"2751f28\" data-element_type=\"widget\" data-widget_type=\"marketum_heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\n        <div class=\"marketum_heading_widget\">\n                                <h2 class=\"marketum_heading\">\n                        QoreAudit: The mechanics of the semantic contextualization.                     <\/h2>\n                            <\/div>\n        \t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e64f681 elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"e64f681\" data-element_type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7c21a37 elementor-widget elementor-widget-text-editor\" data-id=\"7c21a37\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><strong>By Alkis Papadopoullos, CEO and CSO at Coginov<\/strong><\/p><p>As privacy laws are enacted throughout various countries and regions, the ability to uncover relevant tags indicative of personal or sensitive data (often referred to as PII \u2013 personal identifiable information \u2013 and SII \u2013 sensitive identifiable information) is becoming more and more important. Achieving this requires tools and algorithms to be able to put text-based keywords and expressions in context in order to minimize false positive or false negative hits when discovering PII or SII data.\u00a0 Typical types of said data can be person or organization names, phone numbers, addresses, financial information or records, citizen identification documents, etc.<\/p><p>To address this need, we present some ideas around the mechanics of semantic contextualization. Platforms that attempt to discover and extract such information have a set of very tangible expectations such as obtaining a list of possible PII or SII candidates for review and subsequent calls to action regarding disposal, storage, or protection of this data. However, this proves very difficult to accomplish if analysis is based solely on brute force extraction without any context. For example, is what seems to be a person\u2019s name, actually part of street name and hence part of an address? Is a sixteen-digit numeral actually a credit card number? Etc.<\/p><p>Machine learning based semantic analysis can help to achieve these goals, primarily because it involves associating to each potential sensitive data piece, the meaning of that data; it is thus equivalent to extracting and storing concepts rather than keywords. By identifying concepts along with named entities (PII and SII data) it is possible to achieve three important goals that are the cornerstones of reliably identifying personal information:<\/p><ul><li>Avoid breaking up conceptual units consisting of multiple words (\u201cmy credit card number is\u201d).<\/li><li>Use different semantic categories to facilitate discovery of unexpected or related themes or concepts.<\/li><li>Establish correlations between concepts to provide context for analysis.<br \/><br \/><\/li><\/ul><p>Coginov\u2019s QoreAudit product helps to precisely achieve such goals. Using a natural language processing approach that combines semantic analysis with proprietary machine learning algorithms we strive to help users reliably identify meaning in content and relate it to potential PII and SII data. This enhances customers\u2019 ability to reliably mine data for actionable information and does so all the while reducing the time that must be spent analyzing data to draw reliable conclusions. This in turn means that we can identify whether a concept evoked in a comment is clearly referring to a snippet of personal or sensitive data.<\/p><p>Another very significant advantage of semantic contextualization is the ability to compute a document\u2019s \u201csemantic profile\u201d based on the type of PII and SII data extracted. IN so doing, it is possible then possible to assess the level \u201csensitivity\u201d of a given document or set of documents and much more accurately determine whether chances of identity theft, intellectual property theft, sensitive financial data acquisition, etc., are higher. By mapping the most relevant concepts to potential PII and SII data, we can determine several very interesting things:<\/p><ul><li>What is the real risk factor associated with a given document or set of documents.<\/li><\/ul><ul><li>How much conceptual and PII or SII overlap there is across multiple documents<\/li><li>Reduce duplication of documents with PII or SII data, as duplicates multiply the chance of more effectively stealing private data).<\/li><li>Where are the riskiest documents concentrated (i.e. from which data store or source).<\/li><li>Take measures to address loss of private data through tangible risk-based assessment and call to action!<br \/><br \/><\/li><\/ul><p>In summary, through semantic contextualization Coginov\u2019s QoreAudit product allows customers to gain actionable insights more rapidly from most or all of their data repositories, understand the specifics about why certain documents or sets of documents are riskier than others, and take tangible action to protect all PII and SII data they hold. \u00a0Please feel to contact us at sales@coginov.com if you are interested in further information or a demo of our product.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4c40d3a elementor-widget elementor-widget-marketum_blockquote\" data-id=\"4c40d3a\" data-element_type=\"widget\" data-widget_type=\"marketum_blockquote.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\n        <div class=\"marketum_blockquote_widget\">\n            <div class=\"marketum_blockquote marketum_view_type_3\">\n                                        <div class=\"marketum_blockquote_marker_container\">\n                                                <div class=\"marketum_blockquote_marker marketum_blockquote_marker_1\"><\/div>\n                        <div class=\"marketum_blockquote_marker marketum_blockquote_marker_2\"><\/div>\n                                                <\/div>\n                        COGINOV <p>\n\nWe create innovative solutions  <p>\n\nCOGINOV is recognized as a world leader in semantic technologies and information management. We are a Canadian software company offering our customers innovative solutions for managing structured and unstructured information. Our head office is based in Montreal.                     <p class=\"marketum_blockquote_author\"><span><\/span><\/p>\n                                <\/div>\n        <\/div>\n        \t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9f06c35 elementor-widget elementor-widget-text-editor\" data-id=\"9f06c35\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Coginov&#8217;s Qore platform technology enhances the information value chain, transforming unstructured content into highly contextualized, accessible and valuable information. Coginov&#8217;s solutions enable you to capture, analyze, engage, automate and manage your information assets, with unrivalled accuracy and efficiency.<\/p><p><span class=\"TextRun SCXW238616509 BCX0\" lang=\"FR-CA\" xml:lang=\"FR-CA\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW238616509 BCX0\">Discover our solutions <\/span><a href=\"https:\/\/www.coginov.com\/en\/qoreaudit\/\"><span class=\"NormalTextRun SpellingErrorV2Themed SCXW238616509 BCX0\">QoreAudit<\/span><\/a><span class=\"NormalTextRun SCXW238616509 BCX0\">,\u00a0<\/span><a href=\"https:\/\/www.coginov.com\/en\/qore-ultima-2\/\"><span class=\"NormalTextRun SpellingErrorV2Themed SCXW238616509 BCX0\">QoreUltima<\/span><\/a><span class=\"NormalTextRun SCXW238616509 BCX0\"> and <\/span><a href=\"https:\/\/www.coginov.com\/en\/qoremail\/\"><span class=\"NormalTextRun SpellingErrorV2Themed SCXW238616509 BCX0\">QoreMail<\/span><\/a><\/span><span class=\"EOP SCXW238616509 BCX0\" data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:160,&quot;335559740&quot;:259}\">\u00a0<\/span><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t\n    <div class=\"xs_social_share_widget xs_share_url after_content \t\tmain_content  wslu-style-1 wslu-share-box-shaped wslu-fill-colored wslu-none wslu-share-horizontal wslu-theme-font-no wslu-main_content\">\n\n\t\t\n        <ul>\n\t\t\t        <\/ul>\n    <\/div> \n","protected":false},"excerpt":{"rendered":"<p>As privacy laws are enacted throughout various countries and regions, the ability to uncover relevant tags indicative of personal or sensitive data (often referred to as PII \u2013 personal identifiable information \u2013 and SII \u2013 sensitive identifiable information) is becoming more and more important. <\/p>\n","protected":false},"author":4,"featured_media":7793,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"postBodyCss":"","postBodyMargin":[],"postBodyPadding":[],"postBodyBackground":{"backgroundType":"classic","gradient":""},"footnotes":""},"categories":[111],"tags":[112],"class_list":["post-7782","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning-en","tag-machine-learning-en"],"_links":{"self":[{"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/posts\/7782","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/comments?post=7782"}],"version-history":[{"count":14,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/posts\/7782\/revisions"}],"predecessor-version":[{"id":7807,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/posts\/7782\/revisions\/7807"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/media\/7793"}],"wp:attachment":[{"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/media?parent=7782"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/categories?post=7782"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.coginov.com\/en\/wp-json\/wp\/v2\/tags?post=7782"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}