In this era where the global data sphere is expected to reach 175 zettabytes by 2025, it has never been more important for organizations to understand their data as early as possible. It’s not only important to apply traditional “right-side” EDRM phases like Analysis and Review further upstream, but it’s important to fully leverage technology to tame “eDiscovery in the wild”.
Two machine learning tools to leverage in eDiscovery further upstream are sentiment analysis and automated (auto) classification. Let’s look at these two technical approaches and how they facilitate conducting eDiscovery further upstream in to support today’s Big Data challenges.
Sentiment analysis (also known as opinion mining or emotion AI) is the use of natural language processing (NLP) to identify and extract subjective information from source materials. It involves determining the attitude, sentiment, or emotion of a subject based on their spoken or written content. Sentiment analysis can be applied to a wide variety of data, and it can provide an organization with a wealth of information about its customers, products, brand, and even employees.
The simplest form of sentiment analysis is binary classification (positive or negative), but it can also involve more nuanced classifications such as “neutral”, “happy” or “sad”. After the sentiment is determined, the results are interpreted to generate insights. This might involve identifying trends over time, comparing sentiment between different demographic groups and so forth.
Automated data classification utilizes advanced algorithms and machine learning models to effortlessly organize information into predefined classes or categories without the need for manual intervention. This streamlined process uncovers patterns and extracts valuable insights from vast quantities of data rapidly and effectively. This approach commonly employs innovative technologies such as NLP and image recognition, as well as other AI and machine learning methodologies.
The ability to conduct sentiment analysis upstream not only enables organizations to conduct eDiscovery to support use cases such as litigation more efficiently, but conducting sentiment analysis early enough could help avoid litigation altogether! Here are ways in which sentiment analysis can support the discovery process:
Sentiment analysis can also help organizations proactively avoid litigation by regularly analyzing internal communications and flagging instances of negative sentiment. As a result, organizations can identify potential issues before they escalate into legal disputes. This could include workplace harassment, discrimination, or other forms of misconduct.
Not only that, but sentiment analysis can also be used to monitor compliance with internal policies or identify potential sources of dissatisfaction from customers that could lead to litigation. The ability to leverage sentiment analytics to a greater portion of your organization’s corpus than ever is moving this analysis further upstream in the eDiscovery life cycle – to the Identification phase and even Information Governance!
The creation of customizable auto classification rules can also have a significant benefit when applied upstream in eDiscovery or even during Information Governance. For example:
Organizational needs change continually to identify different types of documents or comply with ever-changing laws and regulations, so the ability to customize auto-classification to support those ever-changing needs is paramount.
Getting started with eDiscovery when the case or project begins is too late! So, beginning analysis on the data after collection is beyond late! With Big Data, continually changing regulations and other challenges, organizations must not only move eDiscovery upstream in the EDRM life cycle, but they must also leverage state-of-the-art technologies to conduct eDiscovery upstream as efficiently and cost-effectively as possible. Leveraging machine learning technologies such as sentiment analysis and auto-classification maximizes the benefit of moving eDiscovery upstream!
For more regarding how Veritas Alta™ Classification pairs with the Veritas Data Compliance & Governance Solutions Portfolio to support your storage optimization, eDiscovery, regulatory compliance and data security needs, click here.