Apache Spark architecture allows a continuous stream of data by dividing the stream into micro-batches called Discretized stream or Dstream, which is an API. Dstream is a sequence of RDDs that are created from input data or from sources such as Kafka, Flume, or by applying operations on other Dstream.