September 8, 2020 By Vasfi Gucer 3 min read

You’ve likely heard it before, but it’s worth repeating that more than 80% of all data collected by organizations is not in a standard relational database. Instead, it’s trapped in unstructured documents, social media posts, machine logs, images and other sources. Many organizations face challenges to manage this deluge of unstructured data. For example, if you want to use large-scale analytics to gain insights for your business priorities, how are you going to pinpoint and activate the relevant data? Furthermore, how do you go about identifying and classifying sensitive data while removing data that’s redundant and obsolete?

Metadata management software like IBM Spectrum® Discover can help you manage unstructured data by lessening data storage costs, uncovering hidden data value and reducing the risk of massive data stores. Using such a product can enable you to make better business decisions and gain and maintain a competitive advantage.

Metadata management for AI solutions

Today, many businesses are looking for opportunities to take advantage of machine learning, deep learning and other AI technologies. Some of the most common tasks AI performs include:

  • Extracting information from pictures (computer vision)
  • Transcribing or understanding spoken words (speech to text and natural language processing)
  • Pulling insights and patterns out of written text (natural language understanding)
  • Speaking what’s been written (text to speech, natural language processing)
  • Autonomously moving through spaces based on its senses (robotics)
  • Generally looking for patterns in heaps of data (machine learning)

Real-world examples of these AI solutions include managing medical imaging data and “AI Doctors” in the healthcare industry; identifying fraud, algorithmic trading and portfolio management in financial services; automated claims handling in the insurance industry; and predictive maintenance and AI-assisted designs in the manufacturing industry.

Metadata management solutions like Spectrum Discover are particularly useful to businesses interested in using machine learning to gain more insights from their data. By helping you identify and prepare the data for analysis through machine learning, the software can help you fast track your AI projects.

New IBM Redbooks on AI and IBM Spectrum Discover

If you’re interested in learning more about metadata management software, 2 recent IBM Redbooks cover practical AI use cases with IBM Spectrum Discover and other IBM Storage software:

Making Data Smarter with IBM Spectrum Discover: Practical AI Solutions explores 6 use cases for AI solutions using Spectrum Discover in technical depth:

  • Categorizing medical imaging data with content-search capability
  • Extracting metadata from LIDAR imagery with custom applications
  • Organizing training data sets for artificial intelligence
  • Using artificial intelligence in medical imaging – JFR Challenge
  • Data governance use case: Data staging for high-performance processing
  • Data optimization use case: Data migration to tape for cost-efficient archiving

In addition, this book offers a reference architecture on how to design and implement an AI data pipeline using IBM Spectrum Discover.

In the second Redbooks publication, Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover, you’ll find in-depth use cases from healthcare, life sciences and financial services. This paper explains how IBM Spectrum Discover integrates with the IBM Watson® Knowledge Catalog component of IBM Cloud Pak® for Data. This integration enables storage administrators, data stewards and data scientists to efficiently manage, classify and gain insights from massive amounts of data. The integration improves storage economics, helps mitigate risk and accelerates large-scale analytics to create competitive advantage and speed critical research.

You can explore other technical content at the IBM Redbooks website.

Take advantage of no-cost technical content

I hope you enjoy these books and find their content valuable. We would love to hear from you, so feel free to contact us if you have any questions or comments.

Was this article helpful?
YesNo

More from Artificial intelligence

Responsible AI is a competitive advantage

3 min read - In the era of generative AI, the promise of the technology grows daily as organizations unlock its new possibilities. However, the true measure of AI’s advancement goes beyond its technical capabilities. It’s about how technology is harnessed to reflect collective values and create a world where innovation benefits everyone, not just a privileged few. Prioritizing trust and safety while scaling artificial intelligence (AI) with governance is paramount to realizing the full benefits of this technology. It is becoming clear that…

Taming the Wild West of AI-generated search results

4 min read - Companies are racing to integrate generative AI into their search engines, hoping to revolutionize the way users access information. However, this uncharted territory comes with a significant challenge: ensuring the accuracy and reliability of AI-generated search results. As AI models grapple with "hallucinations"—producing content that fills in gaps with inaccurate information—the industry faces a critical question: How can we harness the potential of AI while minimizing the spread of misinformation? Google's new generative AI search tool recently surprised users by…

Are bigger language models always better?

4 min read - In the race to dominate AI, bigger is usually better. More data and more parameters create larger AI systems, that are not only more powerful but also more efficient and faster, and generally create fewer errors than smaller systems. The tech companies seizing the news headlines reinforce this trend. “The system that we have just deployed is, scale-wise, about as big as a whale,” said Microsoft CTO Kevin Scott about the supercomputer that powers Chat GPT-5. Scott was discussing the…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters