You're only given
a little spark of madness. You
mustn't lose it. -- Robin Williams
MIT’s The Spark
newsletter recently highlighted a company working on enabling cement production
that produces far less carbon emissions than traditional processes.
Cement, the glue that binds together the aggregates that make concrete the
amazing building material that it has been since ancient Roman engineers began
using it, requires high heat to produce.
Startup Sublime Systems has developed a technique using electrochemistry to
produce cement at a much lower cost to the environment.
The Spark’s article describes the approach.
Equally interesting to me is the progress the company has made from
bench scale to demonstration scale, and their plan to move on to commercial scale.
It is a fascinating look into the challenges faced by any company trying to
move from concept to commercialization.
TIP:
Subscribe to MIT The Spark (https://www.technologyreview.com/
)
Here are excerpts from the article in The Spark …
///////
The Spark
By Casey Crownhart • 07.05.23
Hello hello, welcome back to The Spark!
A few weeks ago, I found myself in a room where fluorescent lights reflected
off the stainless steel tanks lining the walls. The setup reminded me of an
exceedingly high-tech craft brewery.
I wasn’t at a cider tasting, but on a visit to Sublime Systems (https://sublime-systems.com/ ), a
Boston-based startup working to clean up one of the world’s toughest climate
challenges: cement. Today, making cement involves a whole lot of fossil fuels,
and this one material accounts for about 8% of global emissions.
But it might not have to be that way. So for the newsletter this week, come
along with me to see what the startup is up to, and how its process could
change the way we build.
To sum it up briefly, cement is a climate nightmare for two main reasons.
One, the process used to make cement requires super-high temperatures which
today basically means you have to burn fossil fuels in the process. Second,
there are chemical reactions involved in transforming minerals into working
cement, and those release carbon dioxide.
Sublime’s answer is to use electrochemistry. The company’s cofounders, Yet-Ming
Chiang and Leah Ellis, both made their mark in the battery world before turning
to building materials. While at MIT, the duo developed a set of chemical
reactions powered by electricity that can transform minerals into the cement we
know and love today. They cofounded Sublime Systems in 2020.
What I was most interested in during my visit was seeing how the company is taking lab results and transforming them to
work at a much larger scale.
Things started out small: the first time she and a
labmate made cement, it was about the same volume as a single die.
Years later, that small scale is almost inconceivable when you look around the
company’s pilot facility. The ceilings feel dozens of feet high, and I wouldn’t
be able to get my arms around the tanks that line the room.
This facility started up in November 2022, recalls Mike Corbett, Sublime’s head
of engineering. The team moved quickly to build it, going from design to
execution in about nine months.
The company is doing something entirely new by bringing electrochemistry to
cement production. But they’ve been able to leverage technology from other
industries, like mining and chemical production, to find equipment that will
work for what they’re trying to do. “You can usually beg and borrow from other
industries to solve similar technical problems,” Corbett says.
The pilot line is a huge upgrade from the early days, but as Ellis put it, in
the grand scheme of the industry, it’s still “a cement plant for ants.”
The next step for the startup is to build a
demonstration facility producing around 100 tons per day. “That’s the
size where you’re no longer invisible to the cement world,” Ellis says. The
current goal is to have that facility running in 2025. After
that, there’s yet another step: commercial scale, at about a million tons a
year.
///////
TIP:
Google sublime systems cement
Some results …
///////
Electrochemical Synthesis of Low-Carbon Cement
Project Innovation +
Advantages:
Cement is responsible for 8% of global CO2 emissions. Currently, the only
economical way to make Portland cement’s key ingredient, lime, is by thermally
decomposing limestone. This reaction contributes ~75% of cement’s emissions.
Sublime Systems (Sublime) will build an electrochemical system to produce lime
using off-peak renewable electricity and calcium sources that do not release
CO2. The lime produced may possess exceptional purity, consistency, and
reactivity, enabling next-generation low-carbon cements. If successful and
scaled, Sublime’s electrochemical synthesis of lime would reduce energy-related
emissions in the U.S. from lime and cement making while simultaneously
providing ancillary grid services, enabling proliferation of renewables.
https://arpa-e.energy.gov/technologies/projects/electrochemical-synthesis-low-carbon-cement
///////
Online published (draft) 25 OCT 2022
Dr. Jutta Lauf for NATO ENSEC CoE
Is
de-carbonising the construction industry possible? An overview of advances in
materials and processes
Jutta Lauf
Dr. Jutta Lauf was a Research Fellow at the NATO Energy Security Centre of
Excellence from 2020 to
2022.
Corresponding address: NATO ENERGY SECURITY CENTRE OF EXCELLENCE, Research and
Lessons Learned
Division, Šilo g. 5A, LT-10322 Vilnius, Lithuania, NATO Energy Security Centre
of Excellence, info@enseccoe.org
Cement, a key product for construction, is by mass the largest manufactured
product on Earth.
Combined with water and mineral aggregates it forms cement-based materials
(e.g., concrete
and mortar), the second most used substance in the world after water. Cement
based building
materials are energy and cost efficient1, but the globally large scale usage
(4.6 *1012 tons in
2015)1 led to 3% of globally emitted carbon dioxide (CO2) in 20202. Additional
advantages are
the wide availability of the raw materials, a sufficient long period of time
before settling and
its longevity. All these properties make it a versatile material, which is used
in many of NATO’s
infrastructures (Figure 1).
Figure 1: NATO headquarter in Brussel, Blvd Leopold III, 1110 Brussels,
Belgium. It was
constructed as a “Green building” mainly from concrete. Generally the “green”
credentials are
related to the operation of the building, not its construction.4; 3
The traditional form of cement is the so-called ordinary Portland cement (OPC).
The
production process requires grinding and calcining (heating to high temperature
of approx.
1450 °C) a mixture mainly consisting of limestone and clay. The resulting
intermediate
material - known as clinker - is ground to a fine powder with 3–5% gypsum added
to form OPC.
The production of OPC generates on average 842kg CO2 per ton of clinker. Fossil
fuel
combustion is responsible for less than 40% of total CO2 emissions, while
limestone (CaCO3)
decomposition during calcination to calcium oxide (CaO) is responsible for the
remainder5; 1.
In essence, CO2 emissions from clinker production is a mixture of both, an
unavoidable
chemical reaction, and the heating process to start the chemical reaction.
Therefore,
increasing the energy efficiency of clinker production is not sufficient to
significantly reduce
emissions. Carbon capture technologies are necessary to achieve this goal.
Significant
https://enseccoe.org/data/public/uploads/2022/11/emissions-in-construction.pdf
///////
Google® Better!
Jean Steinhardt served as Librarian,
Aramco Americas (https://americas.aramco.com/
), Engineering Division, for 13 years. He now heads Jean Steinhardt Consulting
LLC, producing the same high quality research that he performed for Aramco.
Follow Jean’s blog at: http://desulf.blogspot.com/
for continuing tips on effective online research
Email Jean at jstoneheart@gmail.com with
questions on research, training, or anything else
Not just about desulfurization ... The Blog offers tips & tricks for more effective online research on ANY technology
Friday, July 14, 2023
Inside a high-tech cement laboratory -- The path from concept to commercialization
Friday, July 7, 2023
Pipeline Technology Conference Call for Papers
“Ich bin ein Berliner”
-- John F Kennedy (June 26, 1963)
The 19th
Pipeline Technology Conference, scheduled for 8 - 11 April 2024,
Estrel, Berlin, Germany, has issued a call for papers (https://www.pipeline-conference.com/call-for-papers)
19th Pipeline Technology Conference
Here are the details, from the PTC Conference Web site …
///////
Call for Papers
Interested speakers are invited to submit an abstract (max. 300 words)
describing the main ideas of their paper together with the presenter’s CV (max.
200 words).
Abstracts should not focus on company
presentation but on technical/managerial classifications, R&D, new
technologies or recent case studies.
Your abstract should avoid the use of
a language that is commercial in tone.
Joint presentations between pipeline
operators and technology providers are welcome.
If you have already presented this
abstract in the past, you can still submit it, but you must indicate when and
where it was presented in the past.
Presentations have to be held
in-person in Berlin.
All abstracts will be reviewed in the
ptc Advisory Committee.
Confirmed speakers are requested to
provide a multi-page conference paper for publication in the conference proceedings.
All speakers are invited to join the
exclusive ptc Meetup on 8 April together with the members of the advisory
committee, session chairs, sponsors and exhibitors.
Each speaker gets a presentation time of 20 minutes. All abstracts, papers and
the approved recordings will be published in the ptc Pipeline Open Knowledge
Base.
All speakers have to register for an author ticket via the ticket shop
(confirmed speakers from pipeline operators / municipalities attend
free-of-charge, poster session presenters benefit from a reduced entrance fee).
Technical papers by authors who do not attend the conference in Berlin to
present their papers may be excluded from publication in the ptc Pipeline Open
Knowledge Base.
Conference language: English.
Submit your abstract via our abstract submission system "Ex Ordo"
///////
Google® Better!
Jean Steinhardt served as Librarian,
Aramco Americas (https://americas.aramco.com/
), Engineering Division, for 13 years. He now heads Jean Steinhardt Consulting
LLC, producing the same high quality research that he performed for Aramco.
Follow Jean’s blog at: http://desulf.blogspot.com/
for continuing tips on effective online research
Email Jean at jstoneheart@gmail.com with
questions on research, training, or anything else
Friday, June 30, 2023
Reviews and PDFs
“I never read a
book I must review; it prejudices you so.” -- Oscar Wilde
Actually, I like reviews. Over the years I have developed a number of Google
Scholar alerts that help me keep up with developments in various areas of
interest to me.
One problem with the results of these alerts is that much of those results
reside behind a paywall. So as I browse the alerts that appear in my inbox, I
focus on two things … reviews, and PDFs. In many cases, both are available in
full text at no charge.
TIP:
Create Google Scholar alerts in your field of interest. To save time, look at the
items labeled review or PDF.
Here is a list of titles that have come up as a result of my Google Scholar
alerts over the past few months.
HINT:
If any of the titles seems interesting to you, just copy the title and paste it
into a Google search to find the full text article.
///////
Machine Learning and internet of things
in industry 4.0-A review
Artificial intelligence-based solutions for climate change-a review
Developing an IoT-Based Internal Pipe Inspection Robot Challenges, Solutions
& Future Directions
Effective Tracking of Unknown Clustered Targets Using A Distributed Team of
Mobile Robots
How Blockchain Technology Can Address Circularity and Trace Emission in the
Energy Sector
Utilizing Blockchain Technology for Oil and Gas Industry
A Review of Nanomaterials and their Applications in Oil and Petroleum
Industries
Data Security and Cybersecurity in Saudi Arabia
Identification of Fourth Industrial Revolution technologies using PATSTAT data
Industry 4.0-Features Adopting Digital Technologies in the Oil and Gas Industry
Strengths and Weaknesses of Science and Technology Institutions in Arab
countries
Use Case of Digital Technology Creating Value in Today’s Oil & Gas Industry
by Reducing Cost and Enhancing Drilling Performance
Utilizing Remote Sensing and Data Analytics Techniques to Detect Methane
Emissions from the Oil and Gas Industry and Assist with Sustainability Metrics
Advances in gas-to-liquid technology for environmentally friendly fuel
synthesis-Analytical review of world achievements
Challenges and Opportunities of Low Viscous Biofuel─A Prospective Review
Improving Smart Contracts Management And Designs For Blockchain Systems
Industrial Engineering with Large Language Models-A case study of ChatGPT’s
performance on Oil & Gas problems
Machine Learning Algorithms for Failure Prediction Model and Operational
Reliability of Onshore Gas Transmission Pipelines
Predictive big data analytics for drilling downhole problems-A review
Renewable diesel Production-A Review
Zero programming robotics in additive manufacturing repairs
The+benefits,+Challenges,+and+Future+of+Blockchain+and+The+Internet+of+Things
A Bibliometric Analysis of the role of Industry 4.0 Sensors in Digital
Transformation
A Brief Introduction to Microbial Corrosion in the Oil Industry (2022)
A current review on electron beam assisted additive manufacturing technology
A Review of Advances in Cold Spray Additive Manufacturing
A Review on the Implementation of the BIM Methodology in the Operation
Maintenance and Transport Infrastructure
A Short Review on the Corrosion Behaviour of Wire and Arc Additive Manufactured
Materials
A Systematic Approach to Determine the Cutting Parameters of AM Green Zirconia
in Finish Milling
A Systematic Review of Advanced Sensor Technologies for Non-Destructive Testing
and Structural Health Monitoring
A techno-economic approach for decision-making in metal additive
manufacturing-metal extrusion versus single and multiple laser powder bed
fusion
Accelerating the renewable energy sector through Industry 4.0-Optimization
opportunities in the digital revolution
Additive Manufacturing Processes in Selected Corrosion Resistant Materials A
State of Knowledge Review
Advancing Refining Petrochemicals Integration In The Arabian Gulf
AI-Now-2023-Landscape-Report-Confronting tech power
An assessment of the processing parameters and application of fibre-reinforced
polymers (FRPs) in the petroleum and natural gas industries-A review
Analysis of Hydrogen Production Methods Using the Analytic Hierarchy Process
Application of machine learning and artificial intelligence in oil and gas
industry
Applications of AI in oil and gas projects towards sustainable development-a
systematic literature review
Challenges and recent progress on the application of rapid sand casting for
part production-a review
Changes and improvements in Industry 5.0-A strategic approach to overcome the
challenges of Industry 4.0
Current Downhole Corrosion Control Solutions and Trends in the Oil and Gas
Industry-A Review
Cyber Attacks on Unmanned Aerial Vehicles and Cyber Security Measures
Cyber Security in Blockchain Technology
Deep Learning for Magnetic Flux Leakage Detection and Evaluation of Oil &
Gas Pipelines A Review
Dibenzothiophene Removal from Fuel Oil by Metal-Organic Frameworks Performance
and Kinetics
Digital transformation in asset-intensive organisations-The light and the dark
side
Digital Transformation in Oil and Gas Industry-Opportunities and Challenges
Edge Computing Vs. Cloud Computing-An Overview Of Big Data Challenges And
Opportunities For Large Enterprises
Experimental Investigation on the Wear Performance of Nano-Additives on
Degraded Gear Lubricant
FCC Catalyst Accessibility—A Review
Green Energy Sources Reduce Carbon Footprint of Oil & Gas Industry Processes-A
Review
Green Future Index 2023 (MIT)
Identify blockchain technology strategies in the oil and gas industry
Industrial Decarbonization Roadmap
Industrial internet of things (IIoT) in energy sector
Integration of experimental hydroprocessing and FCC data with process (Aspen
Plus) and refinery optimization (Aspen PIMS) models
Investigation and Implementation of IoT Based Oil & Gas Pipeline Monitoring
System
IoT and Blockchain Integration-Applications, Opportunities, and Challenges
Leveraging Blockchain Technology to Create a More Resilient Supply Chain for
Energy Industry
Life cycle assessment of metal products-A comparison between wire arc additive
manufacturing and CNC milling
Manufacturing of carbon fiber reinforced thermoplastics and its recovery of
carbon fiber-A review
Modeling and Simulation of an Octorotor UAV with Manipulator Arm
Modeling-aided coupling of catalysts, conditions, membranes, and reactors for
efficient hydrogen production from ammonia
Potential Use of Construction Waste for the Production of Geopolymers-A Review
Predictive Deep Learning for Pitting Corrosion Modeling in Buried Transmission
Pipelines
Preparation, characterization, applications and future challenges of
Nanomembrane-A review
Recent Advances, Future Trends, Applications and Challenges of Internet of
Underwater Things (IoUT)-A Comprehensive Review
Refinery 4.0, a Review of the Main Challenges of the Industry 4.0 Paradigm in
Oil & Gas Downstream (2022)
Security threats, countermeasures, and challenges of digital supply chains
Smart Spare Parts Management-A digital supply network perspective
Solar fuel production from hydrogen sulfide-an upstream energy perspective
Suitability assessment of high-power energy storage technologies for offshore
oil and gas platforms-A life cycle cost perspective
Systemic circular economy solutions for fiber reinforced composites
Technological Modernizations in the Industry 5.0 Era-A Descriptive Analysis and
Future Research Directions
///////
Google® Better!
Jean Steinhardt served as Librarian,
Aramco Americas (https://americas.aramco.com/
), Engineering Division, for 13 years. He now heads Jean Steinhardt Consulting
LLC, producing the same high quality research that he performed for Aramco.
Follow Jean’s blog at: http://desulf.blogspot.com/
for continuing tips on effective online research
Email Jean at jstoneheart@gmail.com with
questions on research, training, or anything else
Monday, June 19, 2023
Vive la révolution! The Fourth Industrial Révolution, that is …
The Fourth Industrial Revolution is
a phrase that has been bandied about for the past several years. It suggests
that we are on the verge of fundamental changes in the way we produce the goods
that make life worth living.
But what is the Fourth Industrial Revolution exactly? And how do we identify
the technologies that will make it happen?
Here are two sources that can help us start down that long and winding road.
///////
Identification of Fourth Industrial Revolution technologies
using PATSTAT data
Menéndez de Medina, M.,
Nomaler, Ö., & Verspagen, B. (2023). Identification of Fourth Industrial
Revolution technologies using PATSTAT data. UNU-MERIT. UNU-MERIT Working Papers
No. 023 https://www.merit.unu.edu/publications/wppdf/2023/wp2023-023.pdf
Published: 02/06/2023
Maastricht Economic and social Research institute on Innovation and Technology
(UNU-MERIT)
email: info@merit.unu.edu | website: http://www.merit.unu.edu
Abstract
This document provides a methodological procedure to identify the Fourth
Industrial Revolution technologies using patent data. Attempts to distinguish
these technologies have frequently relied on the European Patent Office (2017,
2020a) methods or have mainly leaned on technical codes and keyword
classifications. Frequently, these studies have the limitation of collecting
technologies arbitrarily and without a deep justification. Only the latest
report from the European Patent Office (EPO, 2020b) attempts to detail the
procedure to recognize the Fourth Industrial Revolution technologies. However,
it does not offer the possibility of being replicated by scholars outside the
organization. This article delivers a procedure to collect Fourth Industrial
Revolution patents relying on key concepts from a detailed literature review -
focused on whether they make up a new revolution and its conceptualization over
timeand the EPO (2020b) report identification method. Subsequently, the
evolution of these technologies and the principal trends are exposed. Finally,
the search queries and the list of identified patents are available (in the
Appendix) to replicate or adapt for other academic purposes.
“4IR technologies are understood in this paper as co-evolving “technological
systems” that combine innovations in the fields of digital data transmission,
smart connected devices, computing, communication and connectivity
technologies, the scope of which goes beyond the manufacturing sector and
includes a broad variety of technological domains such as agriculture, health,
home, infrastructure and services.”
A recent study of Confraria et al. (2021) is not directly focused on the
identification of specific 4IR technologies but on the identification of
emerging technologies.
https://cris.maastrichtuniversity.nl/en/publications/identification-of-fourth-industrial-revolution-technologies-using
///////
Patents and the
Fourth Industrial Revolution: The global technology trends enabling the
data-driven economy (EPO)
This study explains the trends in a variety of digital technologies
like faster wireless internet, smart sensors, “big data” and artificial intelligence.
Taken together these are transforming healthcare, transport, agriculture and
many more sectors – they account for over 10% of all global innovation.
https://www.epo.org/service-support/publications.html?pubid=222#tab3
https://www.epo.org/news-events/in-focus/ict/fourth-industrial-revolution.html
///////
Google® Better!
Jean Steinhardt served as Librarian,
Aramco Americas (https://americas.aramco.com/
), Engineering Division, for 13 years. He now heads Jean Steinhardt Consulting
LLC, producing the same high quality research that he performed for Aramco.
Follow Jean’s blog at: http://desulf.blogspot.com/
for continuing tips on effective online research
Email Jean at jstoneheart@gmail.com with
questions on research, training, or anything else
Tuesday, June 6, 2023
Big Data Analytics … Whaaat?
Whaaat?
To be amazed by something and you
can't believe it is real. Not to be confused with the question
"what." It is an expression of disbelief or amazement. – Urban
Dictionary (https://www.urbandictionary.com
)
I admit to being a little hazy on what “big
data analytics” actually means. An article that popped up in a recent
Google Scholar Alerts search result helped me out on that front.
TIP:
Set up a Google Scholar alert on any topic of interest to you. Results, in my
experience, are about 3 (three) per cent relevant to my specific interests. But
it requires only two minutes of my time to browse the results for the rare
truffles that tantalize my taste buds.
The name of the article … wait for it … is …
Predictive big
data analytics for drilling downhole problems: A review
Do not be misled by the title of the article … It is about so much more than
downhole drilling. In reading the article, I came to understand …
The
goal of big data analytics: To analyze enormous sets of unstructured data in
real time to enable nearly instantaneous decisions.
So, here are excerpts from the article. The full text of the article, by the
way, is available for the low, low price of nothing at: https://www.sciencedirect.com/science/article/pii/S2352484723007710
///////
EXCERPTS
Energy Reports 9 (2023) 5863–5876
Predictive big
data analytics for drilling downhole problems: A review
Aslam Abdullah M. ∗, Aseel A., Rithul Roy, Pranav Sunil
Petroleum Engineering Research Group, School of Chemical Engineering, Vellore
Institute of Technology, Vellore, Tamil Nadu, India
ABSTRACT
With the recent introduction of data recording sensors in exploration, drilling
and production processes, the oil and gas industry has transformed into a
massively data-intensive industry. Big data analytics has acquired a great deal
of interest from researchers to extract and use all the possible information.
This paper presents an outline for predictive big data analytics to forecast
and analyze some downhole problems such as pipe sticking, dog leg and pipe
failure depending on several variables. Different methodologies were studied
under big data, enabling the identification of the paradigm change in data
storage and processing while handling vast diversified data generated in a
short span of life. The evaluated data pattern sets are fed into different
established predictive models and risk prediction windows to highlight future
irregularities for the prevention of accidents. Finally, the game theory is
used to evaluate the best predictive model to discover the optimal model for
the identification of downhole problems.
One of the key phases of the oil and gas industry’s digitization seems to be
big data (BD) analytics. The oil and gas exploration and production industries
now generate enormous datasets on a daily basis as a result of recent
technological advancements.
Big data consists of unstructured (not ordered and text-heavy) and
multi-structured data (containing many data formats arising from interactions
between humans and machines) (Trifu and Ivan, 2016). The phrase big data (also
known as big data analytics or business analytics) identifies the magnitude of
the accessible data collection. There are further properties of the data that
make it suitable for big data applications. IBM appropriately identifies these
features as three V’s. These three V’s refer to volume, variety, and velocity
(Pence, 2014). However, recent publications have added two additional V’s to
provide a detailed explanation of big data. The other Vs consist of veracity
and value (Ishwarappa and Anuradha, 2015).
Big data is not the outcome of a single silver-bullet technology, but rather
the highly complementary combination of several technologies and creative
concepts (Perrons and Jensen, 2015). Despite the fact that this type of
analytics depends on solid data science foundations, there are a number of key
considerations for putting these approaches into effect (Kezunovic et al.,
2020). The storage and processing of large data sets, as well as the transformation
of large data, sets into knowledge, are the primary challenges connected with
big data. It is often believed that the massive amount of big data means that
useful information is hidden and must be unearthed, but analysts cannot simply
intuit the data’s value content (Shull, 2013). In any industry, big data analytics
can give new perspectives. It may result in the accurate identification or
forecasting of new scientific hypotheses, consumer behavior, societal
phenomena, weather patterns, and economic situations (Jayalath et al., 2014).
Table 1 compares traditional data and big data analytics.
5.2. Big data storage and management
The downhole drilling data obtained is an enormous, unstructured, and
complicated data collection that is challenging for conventional data
processing technologies to manage (Chen et al., 2014). The information gathered
by the sensors is multidimensional, and due to the ever-increasing amount of
data being created, quicker and more effective methods of data analysis have become
necessary. Along with the necessary infrastructures for storing and managing
enormous data, there are also specific tools and methodologies for big data
analytics that are essential for making successful judgments at the proper time
(Elgendy and Elragal, 2014). There are techniques and tools which can analyze (process,
decode, and interpret) the operation status and the change in parameters
simultaneously (Kale et al., 2015; Pritchard et al., 2016). The data are
gathered by drilling operators or service providers, and downhole data acquired
from global geographic drilling operations will continually amass and grow in
quantity, ultimately becoming a dataset that exceeds the storage and processing
capacity of a single server (Chen et al., 2014). Multiple distributed servers
are used to store, transmit, and process the collected data before extracting,
transforming, and loading it into different databases for advanced analytics.
These data are very large, ranging in size from Terabytes (TBs) to Petabytes (PBs)
(Meeker and Hong, 2014). Moreover, large data sets may have considerable
variability, hindering the data processing and administration, and varying
integrity as a result of data inconsistency, incompleteness, complexity, delay,
deceit, assumptions, and horizontal scalability to merge dissimilar information
(Chen et al., 2014; Elgendy and Elragal, 2014; Hu et al., 2014). Relational databases,
data marts, and data warehouses are classic techniques for storing and
retrieving structured data.
Several solutions, such as distributed databases and Massive Parallel
Processing (MPP) databases for delivering high query productivity and platform
stability, as well as non-relational databases, were utilized for big data.
Non-relational databases, such as NoSQL, are created to store and manage
non-relational data. NoSQL seeks huge scalability, a flexible data format used
for streamlined creation and deployment of applications. Compared to relational
database systems, NoSQL decouples data storage and management. These databases
emphasize scalable, highperformance data storage and enable data administration
operations to also be done at the application level as opposed to database-specific
languages (Bakshi, 2012).
After storing the data, it has to be analyzed using big data tools and
techniques. According to (He et al., 2011), there are four essential needs for
processing large amounts of data. The foremost prerequisite is rapid data
processing. Due to the fact that disk and internet traffic conflicts with
request performance during performing data, it is vital to minimize the time
necessary for performing data. The next criterion is the speed of query execution.
Many queries are response-time essential due to the demands of high workloads
and real-time requests. As a result, the data placement structure needs to be able
to maintain high query processing rates as the number of inquiries grows
quickly. Consequently, the prerequisite for large-scale data collection is the
efficient use of storage capacity.
Due to limited disk space, it is essential that data storage must be carefully
handled throughout processing and that challenges regarding the storage of the
data should be minimalized. The quick expansion in user behavior might need
extensible storage space and processing speed. The ability to adjust well to
workload patterns that are extremely dynamic is the final need. Massive datasets
are processed by a variety of applications and consumers, for a variety of
purposes in a diverse range of ways, requiring the operating system to be
highly adaptive to unforeseen processing dynamics and not specific to each of
these workload patterns (He et al., 2011).
5.3.1. Challenges in big data analysis
Data mining is the extraction of relevant information and insights from large
datasets using statistical and computational methods. Data mining is an
integral part of big data analytics, which entails processing, analyzing, and
interpreting large and complex datasets to discover patterns, trends, and
insights that can assist organizations in making informed decisions.
Nonetheless, data extraction for big data analytics is not error-free. During the
data collection process, these errors can influence the quality and
dependability of the insights generated from the data (Amirian et al., 2015).
Examples of common data collection errors include:
• Sampling errors: When the sample data used for analysis is not representative
of the population as a whole.
• Measurement errors: When the data collected is not accurate or trustworthy.
• Data entry errors: Errors in data entry occur when information is erroneously
recorded.
• Processing errors: Errors in processing occur when data is improperly
processed.
Data cleansing, data validation, data normalization, and data transformation
are some of the methods used by data mining and analytics practitioners to
reduce the likelihood of these errors. Data cleansing is the process of
identifying and rectifying data errors and inconsistencies, such as absent
values, outliers, and duplicate entries. Validating data involves validating
its accuracy and completeness, such as by ensuring that all required fields are
populated. The normalization of data entails transforming the data into a
standard format, such as converting all dates to a common format. For example,
consider a scenario in which a drilling company wishes to improve the precision
of its drilling operations by analyzing data collected from downhole sensors. The
collected data include measurements of temperature, pressure, and other
parameters that help the company determine the characteristics of the drilled
rock formation. Nevertheless, errors may occur during the data collection
procedure, such as faulty sensors or incomplete data due to technical issues.
These errors can result in erroneous analysis and may lead to drilling in the
incorrect location, resulting in increased costs and decreased efficiency (Liu
et al. 2022).
///////
Google® Better!
Jean Steinhardt served as Librarian,
Aramco Americas (https://americas.aramco.com/
), Engineering Division, for 13 years. He now heads Jean Steinhardt Consulting
LLC, producing the same high quality research that he performed for Aramco.
Follow Jean’s blog at: http://desulf.blogspot.com/
for continuing tips on effective online research
Email Jean at jstoneheart@gmail.com with
questions on research, training, or anything else