ASSIGNMENT 1
Task Management App
Purpose: Manage tasks & projects efficiently
Functions: Task creation, project management,
due date tracking, team collaboration
Design: Homepage dashboard, navigation menu,
task detail page
Budget Tracker
Purpose: Track expenses & create budget plan
Functions: Income/expense tracking, budget
planning, spending analysis, bill reminders
Design: Homepage displaying summary,
navigation menu, pie chart for spending
categorization
Recipe App
Purpose: Collection of recipes, cooking tips &
nutrition information
Functions: Recipe search/collection, meal
planning, nutritional info, cooking tips
Design: Homepage displaying featured content,
navigation menu, recipe detail page with
ingredients, instructions, and nutrition.
1.
2.
3.
Online platforms such as retail, social media and
financial institutions use databases to store and
access data.
They manage customer information, product
details, and order history for efficient search and
display based on various criteria.
Social media platforms also store and access user
information, and financial institutions use
databases to store customer information, account
details, and transaction history.
Propose three applications in
domain projects (e.g. criminology,
economics, brain science, etc.)
Be sure you include:
i. Purpose
ii. Functions
iii. Simple interface design
Name and describe three applications
you have used that employed a
database system to store and access
persistent data. (e.g. airlines, online
trade, banking, university system)
2
1
Information Managment
Spring 2023
ASSIGNMENT 1
NoSQL systems emerged in the 2000s as an
alternative to traditional relational databases,
which were designed to handle structured data
and optimized for transactional processing and
data integrity.
NoSQL systems are designed to handle large-scale,
unstructured, and semi-structured data, with a
more flexible and scalable data model and
optimized for high performance and high
availability.
They differ from traditional relational databases
in terms of scalability, data model, consistency,
and query capabilities. They are designed to
handle big data, horizontally scalable and provide
consistency.
Data mining is needed even when data can be retrieved
efficiently and effectively because data mining goes beyond
simply retrieving data. Data mining involves the process of
discovering hidden patterns and insights from large datasets,
which can be used to inform decision-making, identify
opportunities, and uncover new knowledge.
Retrieving data efficiently and effectively is necessary to access
the raw data that will be mined, but data mining goes beyond
that by applying statistical and machine learning techniques to
extract meaningful information from the data. This includes
tasks such as classification, clustering, association rule mining,
and anomaly detection.
Additionally, data mining helps identify relationships and
dependencies among variables in a dataset that may not be
immediately obvious or easily apparent through simple data
retrieval. This can lead to the discovery of new insights and
knowledge that can inform business decisions, research
questions and other areas of study.
In summary, data mining is needed to extract valuable insights
and knowledge from large datasets that would otherwise
remain hidden, even when data can be retrieved efficiently
and effectively.
Why NoSQL systems emerged in the
2000s? Briefly contrast their features
with traditional database systems.
If data can be retrieved efficiently and
effectively, why data mining is needed?
4
3
Information Managment
Spring 2023
ASSIGNMENT 1
Reddit and Twitter, like many large social media websites,
likely uses a combination of different types of databases to
handle its various data storage needs, such as relational
databases like MySQL or PostgreSQL for structured data,
NoSQL databases like MongoDB or Cassandra for
unstructured data, time-series databases like InfluxDB or
OpenTSDB for time-series data, and graph databases like
Neo4j or ArangoDB for relationship data.
Twitter uses a combination of databases to handle the
different types of data they need to store. Some unique
databases they may use include: Distributed databases,
Search engine databases, Stream processing databases.
These databases allow them to scale, handle real-time data
and search through billions of tweets efficiently.
Reddit likely uses a combination of databases to handle the
different types of data they need to store. Some unique
databases they may use include: Distributed databases,
Search engine databases, and Graph databases. These
databases allow them to scale, handle real-time data, search
through billions of posts and comments efficiently and store
and manage data about relationships between users.
Current database systems may have limitations
when it comes to handling unstructured data,
large-scale data, real-time analytics, high write
loads, dynamic schema, distributed data and
multi-model data.
They are optimized for structured data and
transactional processing, and may not be
well-suited for handling big data, high-frequency
data streams, or data across multiple machines.
Some NoSQL databases can handle multiple data
models but traditional relational databases are
limited to only one data model and are not as
flexible as NoSQL databases in this regard.
Describe at least three tables that
might be used to store information
in a social-network/social
mediasystem such as Twitter or
Reddit.
What are the things current database
system cannot do?
6
5
Information Managment
Spring 2023