Big Data Management Solution


Technologies Used


Project Team

Beth Huck

This team is looking for

Product Manager Investor


It is not a point solution: it is designed from the ground up to manage any type of unstructured data, as well as structured data, and relationships between structured and unstructured data. For many decades now a fundamental method for organizing data has been to store it in discrete units called files. In turn files have been organized into directories, or folders. The volume of data, both in terms of the number of files and the size of files makes this increasingly untenable. A new paradigm is needed. While still able to present data as files and directories, Mediaflux is a bridge between the old world of directories and files and the new world of data and metadata. Mediaflux stores data transparently to users, but makes any part of it rapidly discoverable by leveraging the power of metadata. Mediaflux shortens the time between data capture and decision; it is a revolutionary product that can be integrated into existing environments incrementally, with an evolutionary approach. Mediaflux is a multi-user platform for ingesting, storing and discovering any type of data. Mediaflux is not just but also -Search Engine -Geospatial cataloguing system -Digital asset management system -File system for large distributed data -Workflow engine -Data mover and physical management system -Multi-user platform for ingesting and storing any type of data Distributed data -Geospatial data -Combine structured and unstructured data -Optimized for large files and large numbers of files Metadata is the key Describes data in any way desired Automated metadata extraction, or add manually -Supports rapid discovery of the required data -Automated and integrated workflow and/or quality assurance Ingest, store, discover, transform and consume data. Metadata is the key to the rapid discovery of data. Metadata can be: Automatically extracted as data is ingested – for example, geospatial co-ordinates or bounding boxes, images types and resolutions, and text can all be extracted by plug-in content analyzers. Automatically generated – for example, revision histories and audit trails. User generated – existing metadata may be updated or new metadata added manually at any time. Examples are annotations, labels, tags, comments, and workflow-specific actions. Metadata can conform to any standard or to your own customized schema. Metadata can be ingested while invalid with a subsequent workflow to action and correct it. Mediaflux automates processes and workflows, from the most simple to the most complex, to efficiently and effectively manage data through its lifecycle. Examples include (though the possibilities are endless): Scheduling a reminder e-mail notice for a specific time or event A quality assurance process, requiring people to review and sign-off changes to data before it is published Managing transcoding of video into several formats. Different computers might perform different types of transcoding. There may be many computers performing the transcoding – using work lists automatically provides load-balancing. Workflow can be utilized to automate packaging, quality assurance, and analysis processes, or process transitions may be user initiated.