ASSIGNMENT 1
Reddit and Twitter, like many large social media websites,
likely uses a combination of different types of databases to
handle its various data storage needs, such as relational
databases like MySQL or PostgreSQL for structured data,
NoSQL databases like MongoDB or Cassandra for
unstructured data, time-series databases like InfluxDB or
OpenTSDB for time-series data, and graph databases like
Neo4j or ArangoDB for relationship data.
Twitter uses a combination of databases to handle the
different types of data they need to store. Some unique
databases they may use include: Distributed databases,
Search engine databases, Stream processing databases.
These databases allow them to scale, handle real-time data
and search through billions of tweets efficiently.
Reddit likely uses a combination of databases to handle the
different types of data they need to store. Some unique
databases they may use include: Distributed databases,
Search engine databases, and Graph databases. These
databases allow them to scale, handle real-time data, search
through billions of posts and comments efficiently and store
and manage data about relationships between users.
Current database systems may have limitations
when it comes to handling unstructured data,
large-scale data, real-time analytics, high write
loads, dynamic schema, distributed data and
multi-model data.
They are optimized for structured data and
transactional processing, and may not be
well-suited for handling big data, high-frequency
data streams, or data across multiple machines.
Some NoSQL databases can handle multiple data
models but traditional relational databases are
limited to only one data model and are not as
flexible as NoSQL databases in this regard.
Describe at least three tables that
might be used to store information
in a social-network/social
mediasystem such as Twitter or
Reddit.
What are the things current database
system cannot do?
6
5
Information Managment
Spring 2023