Better security through math

Statistical databases (SDBs) are collections of data used to gather and analyze information from a variety of sources. The data may come from plant floor computations, sales transactions, customer files, voter registrations, medical records, employee rosters, product inventories, or other compilations of facts and figures.


ISSSourceStatistical databases (SDBs) are collections of data used to gather and analyze information from a variety of sources. The data may come from plant floor computations, sales transactions, customer files, voter registrations, medical records, employee rosters, product inventories, or other compilations of facts and figures.

Because database security requires multiple processes and controls, it presents huge challenges to organizations. With the computerization of databases in manufacturing, healthcare, forensics, telecommunications, and other fields, ensuring this kind of security has become important and will become even more crucial in the coming years.

Along those lines, there is now a security-control model for statistical databases.

“Providing privacy and confidentiality in SDBs is not a new issue,” said Harout Aydinian of the Bielefeld University in Germany who wrote a paper on the subject with the late Rudolf Ahlswede. “Privacy interests have evolved from the very first census in the United States. Recorded protests until the mid-20th century reflect constitutional issues resulting from the requirement for U.S. residents to provide sensitive personal information. Questions on census forms about diseases, mortgage values, and other items have raised many concerns.”

While such databases are very helpful in aggregating data, there is a risk that confidential information about an individual’s record may end up deliberately compromised.

“Since such data sets also contain sensitive information, such as the disease of an individual, or the salary of an employee, it is necessary to provide security against the disclosure of confidential information,” Aydinian said. “Even in cases where a user has no direct access to sensitive information, sometimes confidential data about an individual can be inferred by correlating enough statistics.”

Typically, statistical databases only accept queries that involve specific statistical functions (such as sum, average, count, min, max, etc.). However, the use of these queries may render databases susceptible to compromise. For instance, it may be possible to infer information about specific individuals by putting together data from a sequence of statistical queries, using prior knowledge of an individual, or through collusion among users.

An SDB is secure if no one can infer protected data from available queries.

“In the literature, many scenarios of compromise and inference control methods have been proposed to protect SDBs,” Aydinian said. “However, to date no one security control method is capable of completely preventing compromise.”

Query restriction is one of several general approaches used for security control. A “query request” retrieves a subset of data from a database that meets a set of conditions. In query restriction, there are limits on the kind and amount of data you can retrieve.

In one type of query restriction method, only certain sums of individual records (called “SUM queries”) that meet a minimum specified size or number, and satisfy a specified set of conditions, are available to users.

“Consider a company with a large number of employees,” Aydinian said. “Suppose that for each member of the company, the sex, age, rank, length of employment, salary etc. is recorded. The salaries of individual employees are confidential. Suppose that only SUM queries are allowed, i.e. the sum of the salaries of the specified people is returned. Then one might pose the query: What is the sum of salaries for males, above 50, and during the last 10 years?”

The researchers provide an optimal collection of SUM queries that prevents compromise of confidential information — such as individual salaries, for instance. A natural solution is to maximize the number of available SUM queries. The authors obtain tight bounds for the maximum number of such queries that return subsets of data without compromising groups of entries.

“Future work in the query-restriction approach includes evaluation of new security-control mechanisms, which are easy to implement and guarantee absolute security,” Aydinian said. “At the same time, it is desirable that these methods satisfy other criteria like richness of available queries, consistency, cost etc. It also seems promising to develop methods combining different security control mechanisms.”

No comments
The Engineers' Choice Awards highlight some of the best new control, instrumentation and automation products as chosen by...
The System Integrator Giants program lists the top 100 system integrators among companies listed in CFE Media's Global System Integrator Database.
The Engineering Leaders Under 40 program identifies and gives recognition to young engineers who...
This eGuide illustrates solutions, applications and benefits of machine vision systems.
Learn how to increase device reliability in harsh environments and decrease unplanned system downtime.
This eGuide contains a series of articles and videos that considers theoretical and practical; immediate needs and a look into the future.
Choosing controllers: PLCs, PACs, IPCs, DCS? What's best for your application?; Wireless trends; Design, integration; Manufacturing Day; Product Exclusive
Variable speed drives: Smooth, efficient, electrically quite motion control; Process control upgrades; Mobile intelligence; Product finalists: Vote now; Product Exclusives
Machine design tips: Pneumatic or electric; Software upgrades; Ethernet advantages; Additive manufacturing; Engineering Leaders; Product exclusives: PLC, HMI, IO
This article collection contains the 5 most referenced articles on improving the use of PID.
Learn how Industry 4.0 adds supply chain efficiency, optimizes pricing, improves quality, and more.

Find and connect with the most suitable service provider for your unique application. Start searching the Global System Integrator Database Now!

Cyber security cost-efficient for industrial control systems; Extracting full value from operational data; Managing cyber security risks
Drilling for Big Data: Managing the flow of information; Big data drilldown series: Challenge and opportunity; OT to IT: Creating a circle of improvement; Industry loses best workers, again
Pipeline vulnerabilities? Securing hydrocarbon transit; Predictive analytics hit the mainstream; Dirty pipelines decrease flow, production—pig your line; Ensuring pipeline physical and cyber security