A knowledge storage method optimizing area utilization by storing just one copy of similar knowledge. Subsequent situations of the identical knowledge are changed with a pointer or reference to the unique copy. Contemplate, for instance, a number of digital machines on a server using the identical working system recordsdata. As a substitute of every VM storing an entire copy, this system ensures just one set of OS recordsdata exists, with the opposite VMs referencing this central occasion.
This method reduces storage capability necessities, improves knowledge administration effectivity, and simplifies backup and restoration processes. Traditionally, it turned important as knowledge volumes elevated quickly, necessitating extra environment friendly storage options. Advantages embody decrease storage prices, quicker knowledge replication, and decreased community bandwidth consumption throughout backups.
The next sections will delve deeper into the implementation methods, benefits, and limitations of this storage methodology, exploring its position in trendy knowledge administration infrastructures.
1. House Optimization
House optimization is a central end result and driving drive behind the implementation of information deduplication methods. By eliminating redundant knowledge copies, it immediately addresses the problem of escalating storage calls for in trendy computing environments. This optimization has cascading results on price, effectivity, and useful resource allocation.
-
Elimination of Redundancy
Knowledge deduplication identifies and removes duplicate blocks or recordsdata, retaining solely a single occasion. The next occurrences are changed with references or tips that could this authentic copy. A sensible instance includes a number of digital machine pictures containing similar working system elements. Slightly than storing the total OS for every VM, just one occasion is retained, with the others referencing it, considerably decreasing storage footprint.
-
Capability Effectivity
The instant impression of redundancy elimination is a major enhance in storage capability effectivity. Organizations can retailer extra knowledge inside the identical bodily or digital space for storing. Contemplate a big archive of paperwork with repeated logos and commonplace paragraphs. By storing just one occasion of those components, the storage required for your entire archive is drastically decreased.
-
Decreased Infrastructure Prices
House optimization immediately interprets to decrease infrastructure prices. Much less bodily storage {hardware} is required to accommodate the identical quantity of distinctive knowledge. Moreover, decreased storage utilization results in decreased energy consumption, cooling necessities, and related operational bills. This impact is especially pronounced in massive knowledge facilities the place even incremental enhancements in storage effectivity can yield substantial price financial savings.
-
Enhanced Storage Density
The focus of distinctive knowledge inside a smaller storage footprint will increase storage density. This profit improves useful resource utilization and simplifies storage administration. A high-density storage surroundings streamlines backup and restoration processes, accelerates knowledge replication, and reduces the executive overhead related to managing sprawling storage infrastructure.
The advantages derived from area optimization are integral to the worth proposition. Its direct impression on storage capability, price discount, and operational effectivity solidifies its important position in trendy knowledge administration methods.
2. Knowledge Discount
Knowledge discount, within the context of storage methods, denotes the methods employed to reduce the bodily space for storing required to signify a given set of information. The operational effectivity of information discount is basically tied to the ideas underlying storage that eliminates redundancy.
-
Eliminating Redundant Knowledge Blocks
A core mechanism of information discount includes figuring out and eliminating duplicate knowledge blocks inside a storage system. Equivalent blocks, regardless of their file origin, are saved solely as soon as. Subsequent situations are changed with tips that could the distinctive block. For instance, a number of variations of a presentation might include the identical company emblem picture. Just one copy of the picture is bodily saved, with all variations referencing this single occasion. This reduces the general storage footprint.
-
Compression Algorithms
Compression algorithms are integral to knowledge discount. These algorithms scale back the scale of information by encoding it extra effectively. Knowledge is analyzed for patterns and redundancies, that are then represented utilizing fewer bits. Widespread lossless compression methods, reminiscent of Lempel-Ziv variations, guarantee knowledge integrity throughout compression and decompression. The ensuing knowledge requires much less bodily space for storing.
-
Skinny Provisioning
Skinny provisioning permits storage directors to allocate storage sources on an as-needed foundation, optimizing capability utilization. Storage is provisioned in extra of the bodily accessible space for storing, with the idea that not all allotted storage might be concurrently utilized. As knowledge is written, storage is dynamically allotted, stopping the wasteful pre-allocation of unused area. It reduces the preliminary capital expenditure on storage {hardware}.
-
Knowledge Deduplication Ratios
Knowledge deduplication ratios quantify the effectiveness of information discount methods. Ratios point out the quantity of storage saved relative to the unique knowledge measurement. The next ratio signifies better effectivity. For example, a deduplication ratio of 10:1 signifies that 10 TB of authentic knowledge is decreased to 1 TB of bodily storage. These ratios function key efficiency indicators for evaluating the effectivity of storage methods.
These methods coalesce to reduce the bodily storage required for a given dataset. Knowledge discount practices have change into pivotal in trendy knowledge facilities, enabling organizations to handle escalating knowledge volumes inside finite sources. The appliance of those strategies results in substantial price financial savings, improved storage density, and enhanced operational effectivity.
3. Centralized Repository
A centralized repository is a core element within the profitable implementation of information deduplication. It serves as the only location the place distinctive knowledge blocks or recordsdata are saved, making certain that just one bodily copy of every distinct knowledge component exists inside the storage system. This central location is pivotal because it supplies the purpose of reference for all subsequent occurrences of the identical knowledge. And not using a well-defined and managed centralized repository, the core ideas of avoiding redundancy can’t be successfully utilized. For example, think about an enterprise content material administration system the place quite a few paperwork include the identical firm boilerplate. A centralized repository, facilitated by knowledge deduplication, shops this boilerplate solely as soon as, considerably decreasing total storage consumption. The absence of such a repository would consequence within the wasteful duplication of similar content material throughout quite a few paperwork, negating the advantages of deduplication.
The performance of the centralized repository immediately impacts the effectivity and scalability of information deduplication. Environment friendly indexing and metadata administration are important to rapidly determine and find distinctive knowledge blocks inside the repository. Moreover, knowledge integrity mechanisms, reminiscent of checksums and knowledge validation routines, are important to ensure that the information saved within the repository stays constant and dependable. In eventualities involving catastrophe restoration and enterprise continuity, the centralized repository acts as the first supply for knowledge restoration, making its availability and resilience paramount. For instance, a big monetary establishment consolidating its buying and selling knowledge advantages from decreased replication occasions and extra constant restoration factors, because of a strategically carried out centralized repository.
In abstract, the centralized repository is indispensable for knowledge deduplication. It allows the identification and administration of distinctive knowledge, resulting in area optimization, decreased storage prices, and simplified knowledge administration. Challenges related to repository scalability, efficiency, and knowledge integrity have to be addressed by way of cautious planning and implementation. The effectiveness of information deduplication hinges on the dependable and environment friendly operation of its centralized knowledge retailer, highlighting its basic position in trendy storage architectures.
4. Pointer Redirection
Pointer redirection is a basic mechanism enabling the performance. It types the linchpin in how subsequent situations of similar knowledge entry and make the most of the only bodily copy saved inside a system.
-
The Function of Pointers
Pointers function references, indicating the placement of the only saved occasion of the information. As a substitute of making a reproduction copy, the system generates a pointer that factors on to the unique. This mechanism ensures that every one requests for the information are routed to the only supply, sustaining knowledge consistency. For example, in a virtualized surroundings, a number of digital machines might use the identical working system recordsdata. Slightly than every VM storing these recordsdata independently, they’d every include tips that could a shared central repository.
-
Effectivity in Knowledge Entry
When knowledge is accessed by way of a pointer, the system retrieves it from the only saved occasion. This course of ensures effectivity and consistency. Modifying the unique knowledge mechanically updates the view for all references, reflecting modifications with out requiring updates to a number of copies. Contemplate a database containing quite a few information referencing the identical commonplace textual content block. By storing the textual content block as soon as and utilizing pointers, any updates to the textual content block are instantly seen throughout all referencing information.
-
Metadata Administration
Efficient metadata administration is important for pointer redirection. Metadata consists of data reminiscent of pointer areas, entry permissions, and knowledge integrity checks. This metadata have to be meticulously maintained to make sure dependable knowledge entry and stop knowledge corruption. In eventualities the place knowledge spans a number of storage gadgets, metadata administration turns into important for monitoring knowledge areas and making certain seamless entry.
-
Affect on Storage Techniques
Pointer redirection impacts the general structure and performance of storage methods. Implementing pointer redirection requires cautious consideration of efficiency bottlenecks, knowledge integrity, and system scalability. Storage methods have to be optimized to deal with the elevated overhead related to managing pointers and making certain environment friendly knowledge entry. The advantages of decreased storage consumption have to be balanced in opposition to the added complexity of pointer administration.
These aspects of pointer redirection are central to the effectivity and reliability. The efficient implementation requires cautious design and administration, enabling area optimization and knowledge consistency throughout various storage environments.
5. Price Financial savings
The implementation immediately correlates with tangible price financial savings throughout varied operational dimensions. These reductions stem from the optimized use of storage sources and the following lower in associated overheads. The monetary advantages are a key driver for adoption throughout various organizational settings.
-
Decreased Storage Infrastructure
Eliminating duplicate knowledge reduces the overall storage capability required. A smaller storage footprint interprets on to decreased capital expenditures on bodily or digital storage {hardware}. Fewer disks, storage arrays, or cloud storage subscriptions are wanted, resulting in instant price financial savings. Contemplate a media firm archiving video property. Deduplication minimizes storage necessities for similar footage throughout varied initiatives, considerably decreasing storage infrastructure bills.
-
Decrease Operational Bills
Decreased storage capability necessities additionally yield decrease operational bills. Much less {hardware} interprets to decreased energy consumption, cooling calls for, and upkeep overhead. The executive burden of managing sprawling storage infrastructure can be mitigated. Organizations profit from simplified administration duties and decreased labor prices. For example, a healthcare supplier consolidates affected person information; deduplication reduces storage prices, resulting in decrease electrical energy payments and decreased IT workers hours spent on storage administration.
-
Decreased Backup Prices
Backing up smaller volumes of information reduces the time and sources required for backup and restoration processes. Decreased backup home windows reduce operational disruptions and decrease the price of backup software program and {hardware}. Moreover, much less knowledge to move throughout networks interprets to decreased bandwidth consumption and related prices. A monetary establishment backing up buying and selling knowledge advantages from quicker backup occasions, decrease bandwidth prices, and decreased reliance on costly backup options.
-
Optimized Knowledge Administration
Streamlined knowledge administration processes arising from knowledge deduplication can yield further price financial savings. With much less knowledge to handle, organizations can optimize knowledge lifecycle administration insurance policies, enhance knowledge accessibility, and scale back the danger of information loss or corruption. This optimization interprets into improved operational effectivity and decreased IT infrastructure spending. A authorities company archiving public information advantages from simpler knowledge retrieval, improved compliance, and decrease threat of information breaches, resulting in streamlined operations and decreased prices.
These interlinked price financial savings reinforce the compelling monetary benefits. From decreasing capital expenditures to streamlining operational processes, the financial advantages underscore its worth proposition in trendy knowledge administration methods, resulting in important, measurable enhancements in a company’s monetary efficiency and operational effectivity.
6. Backup Effectivity
Backup effectivity, considerably enhanced, emerges as a important profit ensuing from knowledge deduplication methods. The elimination of redundant knowledge copies immediately interprets into decreased backup storage necessities and accelerated backup completion occasions. The impact is most pronounced in environments characterised by excessive ranges of information redundancy, reminiscent of these containing digital machine pictures, software program distribution packages, or massive doc repositories. Contemplate a company with quite a few digital servers, every containing a near-identical working system and software stack. As a substitute of backing up a number of situations of the identical knowledge, deduplication ensures that solely the distinctive blocks are transferred and saved, minimizing the general backup measurement. This not solely reduces storage consumption but additionally decreases the community bandwidth wanted for backup operations.
The sensible significance of improved backup effectivity extends past storage capability optimization. Shorter backup home windows reduce operational disruptions and be certain that methods can be found for business-critical actions. Decreased backup occasions additionally decrease the danger of information loss within the occasion of a system failure throughout a backup operation. Furthermore, the decreased storage footprint simplifies catastrophe restoration planning, as fewer sources are required to copy and preserve backup copies at offsite areas. An actual-world instance can be a big e-commerce firm performing every day backups of its database. Knowledge deduplication can considerably lower the backup period, enabling the corporate to satisfy stringent restoration time aims (RTOs) and restoration level aims (RPOs) whereas minimizing the impression on on-line gross sales.
In abstract, backup effectivity is a key driver for adopting knowledge deduplication applied sciences. The decreased storage necessities, accelerated backup completion occasions, and simplified catastrophe restoration planning contribute to important operational enhancements and value financial savings. Whereas implementing knowledge deduplication requires cautious planning and consideration of things reminiscent of knowledge locality and storage system structure, the ensuing enhancement in backup effectivity represents a considerable return on funding. The efficient administration of information redundancy is integral to maximizing the worth of backup infrastructure and making certain enterprise continuity.
7. Storage Virtualization
Storage virtualization, a way of abstracting bodily storage sources, typically works along with knowledge deduplication to boost effectivity and manageability inside a storage surroundings. The synergy between these applied sciences optimizes useful resource utilization and simplifies storage administration.
-
Abstraction of Bodily Storage
Storage virtualization creates a logical view of storage sources, decoupling them from the underlying bodily {hardware}. This abstraction permits directors to handle storage as a unified pool, whatever the particular gadgets. Contemplate a situation the place a company makes use of a mixture of storage arrays from completely different distributors. Storage virtualization software program consolidates these disparate sources right into a single digital storage pool, simplifying provisioning and administration. Knowledge deduplication can then be utilized throughout this virtualized pool, maximizing area financial savings. The advantages of every complement one another.
-
Centralized Administration and Provisioning
With storage virtualization, provisioning and administration duties are centralized, decreasing administrative overhead. Directors can dynamically allocate storage sources to functions and customers as wanted, optimizing useful resource utilization. For instance, a cloud service supplier makes use of storage virtualization to effectively provision storage to a number of tenants. Knowledge deduplication additional enhances effectivity by eliminating redundant knowledge throughout tenant environments, enhancing total storage density and decreasing operational prices.
-
Improved Knowledge Mobility and Availability
Storage virtualization allows knowledge mobility, permitting directors to maneuver knowledge between completely different storage tiers or gadgets with out disrupting functions. This flexibility improves knowledge availability and simplifies catastrophe restoration planning. In an enterprise surroundings, knowledge might be seamlessly migrated from high-performance storage to lower-cost archival storage because it ages, optimizing storage prices with out impacting software efficiency. Knowledge deduplication minimizes the quantity of information that must be moved, decreasing migration occasions and bandwidth consumption.
-
Enhanced Storage Effectivity
The mix of storage virtualization and knowledge deduplication leads to considerably enhanced storage effectivity. Virtualization optimizes useful resource allocation, whereas deduplication eliminates redundant knowledge copies. This mix results in decrease storage prices, decreased energy consumption, and simplified storage administration. A big analysis establishment makes use of storage virtualization to consolidate storage sources throughout a number of departments. Implementing knowledge deduplication reduces the general storage footprint, enabling the establishment to retailer extra knowledge inside the identical bodily area.
The mixing of storage virtualization amplifies the advantages of information deduplication by offering a unified and manageable storage infrastructure. This synergy maximizes useful resource utilization, simplifies storage administration, and optimizes storage prices, making it a compelling technique for organizations searching for to enhance the effectivity and agility of their storage environments.
8. Decreased Bandwidth
Bandwidth consumption constitutes a major operational price in knowledge administration. Mitigation of bandwidth utilization by way of the employment of storage efficiencies represents a key profit, significantly in environments characterised by frequent knowledge transfers or distant replication.
-
Environment friendly Knowledge Switch
The elimination of redundant knowledge copies minimizes the amount of information that have to be transmitted throughout replication, backup, or migration processes. A sensible instance includes replicating digital machine pictures to a distant catastrophe restoration website. By making certain that solely distinctive knowledge blocks are transferred, community bandwidth necessities are considerably decreased, resulting in quicker replication occasions and decreased transmission prices. Implications embody improved restoration time aims (RTOs) and decrease operational bills.
-
Optimized Distant Backups
Distant backup operations, which regularly pressure community sources, profit considerably. Solely distinctive knowledge blocks are transmitted to the distant backup website, conserving bandwidth. Contemplate a distributed enterprise backing up knowledge from a number of department workplaces to a central knowledge heart. By implementing knowledge deduplication, the amount of information traversing the extensive space community (WAN) is minimized, optimizing bandwidth utilization and decreasing the danger of community congestion.
-
Streamlined Catastrophe Restoration
Catastrophe restoration processes, which contain replicating massive volumes of information to a secondary website, are accelerated and made extra environment friendly. Smaller knowledge switch sizes translate to shorter replication home windows and improved restoration level aims (RPOs). For example, a monetary establishment replicating its buying and selling knowledge to a catastrophe restoration website can obtain quicker restoration occasions and decreased knowledge loss threat by minimizing the quantity of information requiring transmission.
-
Cloud-Based mostly Storage Effectivity
Using knowledge deduplication in cloud-based storage environments minimizes the bandwidth required for importing and downloading knowledge. This discount is especially invaluable for organizations with restricted or metered bandwidth connections. Contemplate a media firm storing video property within the cloud. By eliminating redundant video segments, the bandwidth wanted for importing and streaming content material is decreased, decreasing cloud storage prices and enhancing content material supply efficiency.
These interlinked aspects spotlight the direct contribution in decreasing bandwidth necessities throughout varied knowledge administration operations. Environment friendly knowledge switch, optimized distant backups, streamlined catastrophe restoration, and cloud-based storage effectivity collectively reveal its capability to alleviate bandwidth constraints, resulting in price financial savings and improved operational efficiency.
9. Simplified Administration
Knowledge deduplication inherently contributes to simplified administration of storage infrastructures. By eliminating redundant knowledge copies, the general quantity of information requiring administration is decreased, resulting in efficiencies in varied administrative duties. This discount has a direct, cascading impact. A smaller dataset requires much less time and fewer sources for backup, replication, and catastrophe restoration operations. The decreased complexity permits IT workers to deal with strategic initiatives quite than being burdened by routine upkeep of sprawling storage environments. A tangible instance is a big hospital system that consolidates affected person information right into a centralized archive. Deduplication minimizes the archive’s bodily measurement, resulting in quicker search and retrieval occasions, streamlined compliance reporting, and decreased administrative overhead in managing retention insurance policies.
The discount in knowledge quantity extends past storage capability, additionally influencing the complexity of information migration and archival processes. With much less knowledge to maneuver or retailer, migration initiatives are accomplished quicker and with fewer disruptions. Archival methods might be extra effectively managed, making certain long-term knowledge preservation and compliance with regulatory necessities. Contemplate a regulation agency managing a big quantity of case recordsdata. Implementing deduplication reduces the scale of the archive, simplifying knowledge retention efforts and making certain that important paperwork might be simply positioned and retrieved when wanted. This immediately interprets to decreased authorized dangers and improved operational effectivity.
In abstract, contributes considerably to simplified administration by decreasing knowledge quantity and complexity. This simplification interprets to decrease administrative overhead, improved operational effectivity, and decreased threat throughout varied knowledge administration duties. Whereas preliminary implementation requires cautious planning and configuration, the long-term advantages of simplified administration make knowledge deduplication an integral part of contemporary knowledge administration methods. The capability to do extra with much less is paramount in at present’s data-intensive environments, and simplification is the important thing to attaining sustainable operational excellence.
Steadily Requested Questions About Knowledge Deduplication
This part addresses frequent inquiries relating to the character, implementation, and advantages of information deduplication. It goals to make clear misconceptions and supply a deeper understanding of the expertise.
Query 1: Is Knowledge Deduplication Appropriate for All Knowledge Varieties?
The efficacy varies relying on the information kind. It’s simplest with knowledge exhibiting excessive ranges of redundancy, reminiscent of digital machine pictures, software program distributions, and archival knowledge. Transactional databases and encrypted knowledge might not yield important advantages as a result of their inherent variability or lack of redundancy.
Query 2: What Affect Does Knowledge Deduplication Have on System Efficiency?
The impression on system efficiency relies on the implementation and workload traits. Inline deduplication, which happens as knowledge is written, can introduce latency. Submit-process deduplication, which happens after knowledge is written, minimizes the preliminary write latency however requires further processing. Cautious planning and efficiency monitoring are important to reduce any unfavourable impression.
Query 3: How Does Knowledge Deduplication Differ From Knowledge Compression?
Knowledge compression reduces knowledge measurement by encoding it extra effectively, whereas eliminating redundant copies of information blocks. Compression is usually utilized on the file stage, whereas operates on the block stage. Each methods can be utilized in conjunction to maximise storage effectivity.
Query 4: What Are the Key Concerns for Implementing Knowledge Deduplication?
Key issues embody deciding on the suitable deduplication methodology (inline vs. post-process), assessing the information sorts and redundancy ranges, planning for ample processing energy and reminiscence, and implementing sturdy knowledge integrity mechanisms. Correct planning is essential for profitable implementation.
Query 5: Does Knowledge Deduplication Assure Knowledge Integrity?
Whereas considerably reduces storage redundancy, it doesn’t inherently assure knowledge integrity. Strong knowledge integrity mechanisms, reminiscent of checksums, knowledge validation routines, and redundant storage arrays, must be carried out to guard in opposition to knowledge corruption or loss. A complete knowledge safety technique is important.
Query 6: Can Knowledge Deduplication Be Used With Cloud Storage?
Sure, knowledge deduplication might be employed with cloud storage options to reduce storage prices and optimize bandwidth utilization. Cloud suppliers might supply built-in knowledge deduplication options, or third-party options can be utilized. It’s important to guage the compatibility and integration of options with the particular cloud platform.
Efficient utilization hinges on a transparent understanding of its capabilities, limitations, and the particular necessities of the storage surroundings. Cautious planning and diligent monitoring are important for maximizing its advantages whereas minimizing potential dangers.
The next sections will discover real-world use instances and implementation methods in better element.
Implementing Knowledge Deduplication
The efficient implementation of information deduplication necessitates cautious planning and meticulous execution. The next ideas supply steerage to make sure profitable integration and optimum efficiency.
Tip 1: Conduct a Thorough Knowledge Evaluation: Earlier than implementing knowledge deduplication, conduct a complete evaluation of the information panorama. Determine knowledge sorts with excessive redundancy, assess the general storage capability, and decide the frequency of information entry. The evaluation ought to inform the collection of applicable deduplication strategies and storage configurations.
Tip 2: Select the Proper Deduplication Technique: Choose a deduplication methodology that aligns with particular workload necessities. Inline deduplication can scale back storage capability instantly however might introduce latency, whereas post-process deduplication minimizes preliminary write latency however requires further processing sources. The selection relies on the stability between efficiency and storage effectivity.
Tip 3: Plan for Ample Processing Energy and Reminiscence: Deduplication requires important processing energy and reminiscence. Be certain that the storage infrastructure has ample sources to deal with the computational overhead related to knowledge evaluation and metadata administration. Underestimating useful resource necessities can result in efficiency bottlenecks and decreased storage effectivity.
Tip 4: Implement Strong Knowledge Integrity Mechanisms: Whereas reduces storage redundancy, it doesn’t inherently assure knowledge integrity. Implement sturdy knowledge integrity mechanisms, reminiscent of checksums, knowledge validation routines, and redundant storage arrays, to guard in opposition to knowledge corruption or loss. Common knowledge integrity checks are important.
Tip 5: Monitor Efficiency and Regulate Settings: Repeatedly monitor the efficiency of the storage system after implementing knowledge deduplication. Observe key metrics reminiscent of storage capability, deduplication ratios, and entry latency. Regulate deduplication settings as wanted to optimize efficiency and storage effectivity.
Tip 6: Consider Cloud-Based mostly Options: When utilizing cloud storage, assess the supply of native or third-party options. Contemplate components reminiscent of price, efficiency, integration with current methods, and compliance necessities. Cloud-based implementations can present scalable and cost-effective deduplication capabilities.
Tip 7: Plan for Catastrophe Restoration: Be certain that the catastrophe restoration plan accounts for. Confirm that backup and replication processes are suitable with deduplicated knowledge and that restoration time aims (RTOs) might be met. Common catastrophe restoration drills are important to validate the effectiveness of the plan.
Correct implementation is important for realizing its advantages. Adhering to those tips can guarantee environment friendly and dependable administration.
The next part will summarize key takeaways and supply concluding remarks.
Conclusion
This exploration of single occasion storage has illuminated its core perform: the elimination of information redundancy to optimize storage capability. This method, achieved by way of the retention of a single knowledge occasion and the usage of pointers for subsequent copies, provides substantial advantages. Decreased storage prices, improved backup effectivity, and simplified knowledge administration signify key benefits realized by way of its implementation.
The ideas and practices detailed herein present a foundational understanding for efficient deployment and utilization. Organizations are inspired to guage single occasion storage as a method to boost their knowledge administration methods, thereby maximizing useful resource effectivity and minimizing operational bills in an surroundings of escalating knowledge volumes.