Login  |  Join Us  |  Subscribe to Newsletter
Login to View News Feed and Manage Profile
☰
Login
Join Us
Login to View News Feed and Manage Profile
Agency
Agency
  • Home
  • Information
    • Discussion
    • Articles
    • Whitepapers
    • Use Cases
    • News
    • Contributors
    • Subscribe to Newsletter
  • Courses
    • Data Science & Analytics
    • Statistics and Related Courses
    • Online Data Science Courses
  • Prodigy
    • Prodigy Login
    • Prodigy Find Out More
    • Prodigy Free Services
    • Prodigy Feedback
    • Prodigy T&Cs
  • Awards
    • Contributors Competition
    • Data Science Writer Of The Year
  • Membership
    • Individual
    • Organisational
    • University
    • Associate
    • Affiliate
    • Benefits
    • Membership Fees
    • Join Us
  • Consultancy
    • Professional Services
    • Project Methodology
    • Unlock Your Data
    • Advanced Analytics
  • Resources
    • Big Data Resources
    • Technology Resources
    • Speakers
    • Data Science Jobs Board
    • Member CVs
  • About
    • Contact
    • Data Science Foundation
    • Steering Group
    • Professional Standards
    • Government And Industry
    • Sponsors
    • Supporter
    • Application Form
    • Education
    • Legal Notice
    • Privacy
    • Sitemap
  • Home
  • Information
    • Discussion
    • Articles
    • Whitepapers
    • Use Cases
    • News
    • Contributors
  • Courses
    • Data Science & Analytics
    • Statistics and Related Courses
    • Online Data Science Courses
  • Prodigy
    • Prodigy Login
    • Prodigy Find Out More
    • Prodigy Free Services
    • Prodigy Feedback
    • Prodigy T&Cs
  • Awards
    • Contributors Competition
    • Data Science Writer
  • Membership
    • Individual
    • Organisational
    • University
    • Associate
    • Affiliate
    • Benefits
    • Membership Fees
    • Join Us
  • Consultancy
    • Professional Services
    • Project Methodology
    • Unlock Your Data
    • Advanced Analytics
  • Resources
    • Big Data Resources
    • Technology Resources
    • Speakers
    • Data Science Jobs Board
    • Member CVs
  • About
    • Contact
    • Data Science Foundation
    • Steering Group
    • Professional Standards
    • Government And Industry
    • Sponsors
    • Supporter
    • Application Form
    • Education
    • Legal Notice
    • Privacy
    • Sitemap
  • Subscribe to Newsletter

Unstructured data provides equal risk and opportunities for businesses

26 October 2020
Data Science Foundation
Views (1012)
Follow (1)

Share with your network:

Unstructured data is projected to account for approximately 80% of the data that enterprises will process on a daily basis by 2025. Data breaches and other security issues get a lot of attention in the media, but all businesses working with data, especially data in the cloud, are at risk of data loss. Preventing data loss can be difficult for a number of reasons.

IDG projects that by 2026, there will be 163 zettabytes of data in the world. To put that in context, one zettabyte is equal to a thousand exabytes, a billion terabytes, or a trillion gigabytes. The astronomical amount of data transmitting, living, and working in the cloud is just one of the complications that make securing data a tough task for businesses to manage. Of all the unstructured data in the world, most of it goes completely unused. According to industry analysts IDC, more than 90% of unstructured data is never examined. This means large portions of data float around unsecured and underutilized for many businesses.

That’s why it’s important to understand where unstructured data comes from, why it’s so hard to pin down, the risks of not securing unstructured data, and the rewards of bringing that data into a structured environment.

Hiding in plain sight

Unstructured data can come from almost any source. Nearly every asset or piece of content created or shared by a device in the cloud carries unstructured data. This can include:

  • Product demo videos on your website
  • QR codes for discounts and deals on an e-commerce app
  • Podcasts and other audio blogging files hosted on your website’s blog page
  • Social media messages on platforms like Facebook, Twitter, and LinkedIn

Internal communications and collaboration platforms are major sources of unstructured data. Think Slack, Confluence, and other SaaS applications where many people do their daily work and communicate with colleagues. Most cloud-based applications like these allow unstructured data to pass through massive networks to be shared, copied, accessed and stored unprotected.

IDG Communications published an article written by then-Pitney Bowes Software Vice President Andy Berry in 2018. Berry commented on how the modern workplace approaches data and why these norms contribute to the data loss problem, citing one study that found enterprises using almost 500 unique business applications. SaaS applications generate data that can quickly become obsolete, unusable, and eventually inaccessible.

Data powers everything we do in our professional and personal lives, but with little to no oversight on data hygiene, we often miss out on key opportunities to improve security blindspots and maximize data performance.

A complex problem

The various sources of unstructured data show how complex data loss can be. Many problems with DLP start with the three V’s of data — volume, velocity, and variety. It’s hard for humans and manual review to keep up with the staggering amount of data, speed of data proliferation, and the many different sources of data.

Adding to the problem is the fact that unstructured data is very difficult to organize. It’s impossible to dump every piece of unstructured information into a database or spreadsheet, because that data comes from myriad different sources and likely doesn’t follow similar formatting rules. On top of that, finding unstructured data through manual processes would take more time than there are hours in the day. It’s not a job for humans.

Other roadblocks to unstructured data collection include increasingly stringent privacy regimes, laws that protect intellectual property (IP) and other confidential or proprietary information like trade secrets, and businesses communicating across different security domains between the cloud and traditional hard-drive based storage systems. Information security is evolving at lightning speeds, but some schools of thought are still based on older priorities that focus on preventing outsider threats. It’s important to protect an organization from malicious actors, but what about good-natured, everyday workers who don’t know what they don’t know? That can still hurt an organization in tremendous ways.

Unstructured data isn’t all bad news. It can also be an opportunity for organizations that can recognize two main ideas. First, that this data must be gathered, protected, and understood. Second, that there’s value in all the data that is currently going unused. Computer Weekly cited sources that estimate modern businesses are utilizing as little as 1% of their unstructured data.

Our world runs on data, and each person interacting with apps, platforms, and devices contributes to the growing data reserves. When organizations think about gathering data to help with marketing, business intelligence, and other key functions, they must also factor in the impact of unstructured data. Unstructured data presents equal risk and opportunity for business leaders. When that data lives in the darkness, its only impacts are negative. But when data is brought into the light, we can use that data to be smarter and better at work.

Solving the unstructured data problem

Unstructured data is a major concern for organizations using cloud-based collaboration and communications platforms. Productivity relies on environments where co-workers can share ideas and messages quickly, without fear of exposing sensitive data. Nightfall, a data loss prevention (DLP) solution, provides much-needed security for today’s most used communications and collaboration platforms like Slack, Confluence, and many other popular SaaS & data infrastructure products.

Since these applications lack an internal DLP function, and each allows for the lightning-fast transmission of massive amounts of data, Nightfall’s machine learning-based platform is an essential partner for many organizations handling sensitive information like PII (personally identifiable information), PHI (protected health information), and other business-critical secrets. Nightfall’s three-step approach allows businesses to discover, classify, and protect unstructured data through artificial intelligence (AI) and machine learning (ML). Our solution makes sense of unstructured data, while traditional security solutions solely rely on users to help categorize data through methods like regular expressions (regex), which have limited accuracy in unstructured environments.

Each step of Nightfall’s ML solution is critical to the process of DLP. Discover means a continuous monitor of sensitive data that is flowing into and out of all the services you use. Classify means ML classifies your sensitive data & PII automatically, so nothing gets missed. Protect means businesses can set up automated workflows for quarantines, deletions, alerts, and more. These three arms of DLP save you time and keep your business safe — all with minimal manual process or review oversight from you or your staff.

Helping businesses identify and access unstructured data

Data is a part of life, especially as remote work becomes an essential function for productivity and collaboration. Business leaders must understand the risk of ignoring unstructured data and the value of making that data work for the business. It’s a tall order to identify and bring in a mass of unknown data to the cloud, but the rewards come with a better understanding of your organization, your industry, and your customers. Good things can come from unstructured data — as long as you’re ready to approach the issue with a solid data strategy and a knowledgeable DLP partner like Nightfall.

--------------------------------------------------------------------------------------------------------------

About Nightfall

Nightfall is the industry’s first cloud-native DLP platform that discovers, classifies, and protects data via machine learning. Nightfall is designed to work with popular SaaS applications like Slack & GitHub as well as IaaS platforms like AWS. You can schedule a demo with us below to see the Nightfall platform in action.

“This article is originally posted on Nightfall.ai”

Like
Download

Email a PDF Whitepaper

If you found this Article interesting, why not review the other Articles in our archive.

Login to Comment and Like

Categories

  • Data Science
  • Data Security
  • Analytics
  • Machine Learning
  • Artificial Intelligence
  • Robotics
  • Visualisation
  • Internet of Things
  • People & Leadership
  • Other Topics
  • Top Active Contributors
  • Balakrishnan Subramanian
  • Abhishek Mishra
  • Mayank Tripathi
  • Michael Baron
  • Santosh Kumar
  • Recent Posts
  • New Code of R under COVID-19 outbreak: Reputation, Reliance and Relationship in attracting ‘new enrollments’.
    08 March 2022
  • In Secondary Data We Trust: Secondary Data ‘’Trust’’ Issues
    04 March 2022
  • Get The Best Machine Learning Libraries For Beginners
    06 January 2022
  • Automated machine learning (AutoML)
    05 November 2021
  • Most Liked
  • Cyber Physical Systems
    Likes: 26
    Views: 16733
  • Green Computing: The Future of Computing
    Likes: 23
    Views: 8956
  • Why AI is a great match for your data strategy
    Likes: 18
    Views: 1607
  • Advances in Data Science 2018: Final Speakers & Discussion Themes
    Likes: 16
    Views: 2003
  • Detecting Fraud Using Machine Learning
    Likes: 15
    Views: 1413
To attach files from your computer

    Comment

    You cannot reply to your own comment or question. You can respond to another member's comment in this thread.

    Get in touch

     

    Subscribe to latest Data science Foundation news

    I have read and agree to the Data science Foundation Privacy Policy

    • Home
    • Information
    • Resources
    • Membership
    • Services
    • Legal
    • Privacy
    • Site Map
    • Contact

    © 2022 Data science Foundation. All rights reserved. Data S.F. Limited 09624670

    Site By-Peppersack

    We use cookies

    Cookie Information

    We are using cookies to provide statistics that help us to improve your experience of our site. You can choose to use the site without cookies. However, by continuing to use the site without changing your settings, you are agreeing to our use of cookies.

    Contact Form

    This member is participating in the Prodigy programme. This message will be directed to Prodigy Admin the Prodigy Programme manager. Find out more about Prodigy

    Complete your membership listing and tell others about your interests, experience and qualifications with a Personal Profile page.

    Add a Personal Profile

    Your Personal Profile page is missing information about your experience and qualifications that other members would find interesting. Click here to update.

    Login / Join Us

    Login to your membership account to view your personalised news feed, update your profile, manage your preferences. publish articles and to create a following.

    If you are not a member but work with or have an interest in Data Science, Machine Learning and Artificial Intelligence, join us today.

    Login | Join Us

    Support the work of the Data Science Foundation

    Help to fund our work and enable us to provide free communications and knowledge sharing services to members across the globe.

    Click here to set-up a donation of £30 per year

    Follow

    Login

    Login to follow this member

    Login