edu.berkeley.cs.succinct.impl

SuccinctRDDImpl

class SuccinctRDDImpl extends SuccinctRDD

Implementation of SuccinctRDD.

Linear Supertypes
SuccinctRDD, RDD[Array[Byte]], Logging, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. SuccinctRDDImpl
  2. SuccinctRDD
  3. RDD
  4. Logging
  5. Serializable
  6. Serializable
  7. AnyRef
  8. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. def ++(other: RDD[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  5. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  6. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  7. def aggregate[U](zeroValue: U)(seqOp: (U, Array[Byte]) ⇒ U, combOp: (U, U) ⇒ U)(implicit arg0: ClassTag[U]): U

    Definition Classes
    RDD
  8. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  9. def cache(): SuccinctRDDImpl.this.type

    Persists the Succinct partitions at targetStorageLevel, which defaults to MEMORY_ONLY.

    Persists the Succinct partitions at targetStorageLevel, which defaults to MEMORY_ONLY.

    Definition Classes
    SuccinctRDDImpl → RDD
  10. def cartesian[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(Array[Byte], U)]

    Definition Classes
    RDD
  11. def checkpoint(): Unit

    Definition Classes
    RDD
  12. def clearDependencies(): Unit

    Attributes
    protected
    Definition Classes
    RDD
  13. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  14. def coalesce(numPartitions: Int, shuffle: Boolean)(implicit ord: Ordering[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  15. def collect[U](f: PartialFunction[Array[Byte], U])(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  16. def collect(): Array[Array[Byte]]

    Definition Classes
    RDD
  17. def compute(split: Partition, context: TaskContext): Iterator[Array[Byte]]

    Overrides the compute function to return iterator over Succinct records.

    Overrides the compute function to return iterator over Succinct records.

    Definition Classes
    SuccinctRDD → RDD
  18. def context: SparkContext

    Definition Classes
    RDD
  19. def count(): Long

    Count the number of records in the SuccinctRDD.

    Count the number of records in the SuccinctRDD.

    returns

    The number of records in the SuccinctRDD.

    Definition Classes
    SuccinctRDD → RDD
  20. def countApprox(timeout: Long, confidence: Double): PartialResult[BoundedDouble]

    Definition Classes
    RDD
    Annotations
    @Experimental()
  21. def countApproxDistinct(relativeSD: Double): Long

    Definition Classes
    RDD
  22. def countApproxDistinct(p: Int, sp: Int): Long

    Definition Classes
    RDD
    Annotations
    @Experimental()
  23. def countByValue()(implicit ord: Ordering[Array[Byte]]): Map[Array[Byte], Long]

    Definition Classes
    RDD
  24. def countByValueApprox(timeout: Long, confidence: Double)(implicit ord: Ordering[Array[Byte]]): PartialResult[Map[Array[Byte], BoundedDouble]]

    Definition Classes
    RDD
    Annotations
    @Experimental()
  25. def countOffsets(query: String): Long

    Counts for all occurrences of a query in the RDD.

    Counts for all occurrences of a query in the RDD.

    query

    The count query.

    returns

    The count of the number of occurrences of the query.

    Definition Classes
    SuccinctRDD
  26. def countOffsets(query: Array[Byte]): Long

    Counts for all occurrences of a query in the RDD.

    Counts for all occurrences of a query in the RDD.

    query

    The count query.

    returns

    The count of the number of occurrences of the query.

    Definition Classes
    SuccinctRDD
  27. final def dependencies: Seq[Dependency[_]]

    Definition Classes
    RDD
  28. def distinct(): RDD[Array[Byte]]

    Definition Classes
    RDD
  29. def distinct(numPartitions: Int)(implicit ord: Ordering[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  30. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  31. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  32. def extract(offset: Long, length: Int): Array[Byte]

    Extract data from an RDD *

    Extract data from an RDD *

    offset

    Offset into original RDD.

    length

    Number of bytes to be fetched.

    returns

    The extracted data.

    Definition Classes
    SuccinctRDDImplSuccinctRDD
  33. def filter(f: (Array[Byte]) ⇒ Boolean): RDD[Array[Byte]]

    Definition Classes
    RDD
  34. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  35. def first(): Array[Byte]

    Definition Classes
    RDD
  36. def firstParent[U](implicit arg0: ClassTag[U]): RDD[U]

    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  37. def flatMap[U](f: (Array[Byte]) ⇒ TraversableOnce[U])(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  38. def fold(zeroValue: Array[Byte])(op: (Array[Byte], Array[Byte]) ⇒ Array[Byte]): Array[Byte]

    Definition Classes
    RDD
  39. def foreach(f: (Array[Byte]) ⇒ Unit): Unit

    Definition Classes
    RDD
  40. def foreachPartition(f: (Iterator[Array[Byte]]) ⇒ Unit): Unit

    Definition Classes
    RDD
  41. def getCheckpointFile: Option[String]

    Definition Classes
    RDD
  42. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  43. def getDependencies: Seq[Dependency[_]]

    Attributes
    protected
    Definition Classes
    RDD
  44. def getFirstParent: RDD[SuccinctPartition]

    Returns first parent of the RDD.

    Returns first parent of the RDD.

    returns

    The first parent of the RDD.

    Attributes
    protected[edu.berkeley.cs.succinct]
    Definition Classes
    SuccinctRDD
  45. def getPartitions: Array[Partition]

    Returns the array of partitions.

    Returns the array of partitions.

    returns

    The array of partitions.

    Attributes
    protected
    Definition Classes
    SuccinctRDD → RDD
  46. def getPreferredLocations(split: Partition): Seq[String]

    Attributes
    protected
    Definition Classes
    RDD
  47. def getStorageLevel: StorageLevel

    Definition Classes
    RDD
  48. def glom(): RDD[Array[Array[Byte]]]

    Definition Classes
    RDD
  49. def groupBy[K](f: (Array[Byte]) ⇒ K, p: Partitioner)(implicit kt: ClassTag[K], ord: Ordering[K]): RDD[(K, Iterable[Array[Byte]])]

    Definition Classes
    RDD
  50. def groupBy[K](f: (Array[Byte]) ⇒ K, numPartitions: Int)(implicit kt: ClassTag[K]): RDD[(K, Iterable[Array[Byte]])]

    Definition Classes
    RDD
  51. def groupBy[K](f: (Array[Byte]) ⇒ K)(implicit kt: ClassTag[K]): RDD[(K, Iterable[Array[Byte]])]

    Definition Classes
    RDD
  52. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  53. val id: Int

    Definition Classes
    RDD
  54. def intersection(other: RDD[Array[Byte]], numPartitions: Int): RDD[Array[Byte]]

    Definition Classes
    RDD
  55. def intersection(other: RDD[Array[Byte]], partitioner: Partitioner)(implicit ord: Ordering[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  56. def intersection(other: RDD[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  57. def isCheckpointed: Boolean

    Definition Classes
    RDD
  58. def isEmpty(): Boolean

    Definition Classes
    RDD
  59. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  60. def isTraceEnabled(): Boolean

    Attributes
    protected
    Definition Classes
    Logging
  61. final def iterator(split: Partition, context: TaskContext): Iterator[Array[Byte]]

    Definition Classes
    RDD
  62. def keyBy[K](f: (Array[Byte]) ⇒ K): RDD[(K, Array[Byte])]

    Definition Classes
    RDD
  63. def log: Logger

    Attributes
    protected
    Definition Classes
    Logging
  64. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  65. def logDebug(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  66. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  67. def logError(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  68. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  69. def logInfo(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  70. def logName: String

    Attributes
    protected
    Definition Classes
    Logging
  71. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  72. def logTrace(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  73. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Attributes
    protected
    Definition Classes
    Logging
  74. def logWarning(msg: ⇒ String): Unit

    Attributes
    protected
    Definition Classes
    Logging
  75. def map[U](f: (Array[Byte]) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  76. def mapPartitions[U](f: (Iterator[Array[Byte]]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  77. def mapPartitionsWithIndex[U](f: (Int, Iterator[Array[Byte]]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
  78. def max()(implicit ord: Ordering[Array[Byte]]): Array[Byte]

    Definition Classes
    RDD
  79. def min()(implicit ord: Ordering[Array[Byte]]): Array[Byte]

    Definition Classes
    RDD
  80. var name: String

    Definition Classes
    RDD
  81. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  82. final def notify(): Unit

    Definition Classes
    AnyRef
  83. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  84. def parent[U](j: Int)(implicit arg0: ClassTag[U]): RDD[U]

    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  85. val partitionOffsetRanges: Array[Range]

  86. val partitioner: Option[Partitioner]

    Definition Classes
    RDD
  87. final def partitions: Array[Partition]

    Definition Classes
    RDD
  88. val partitionsRDD: RDD[SuccinctPartition]

    The input RDD of partitions.

    The input RDD of partitions.

    Definition Classes
    SuccinctRDDImplSuccinctRDD
  89. def persist(newLevel: StorageLevel): SuccinctRDDImpl.this.type

    Persists the Succinct partitions at the specified storage level, ignoring any existing target storage level.

    Persists the Succinct partitions at the specified storage level, ignoring any existing target storage level.

    Definition Classes
    SuccinctRDDImpl → RDD
  90. def persist(): SuccinctRDDImpl.this.type

    Definition Classes
    RDD
  91. def pipe(command: Seq[String], env: Map[String, String], printPipeContext: ((String) ⇒ Unit) ⇒ Unit, printRDDElement: (Array[Byte], (String) ⇒ Unit) ⇒ Unit, separateWorkingDir: Boolean): RDD[String]

    Definition Classes
    RDD
  92. def pipe(command: String, env: Map[String, String]): RDD[String]

    Definition Classes
    RDD
  93. def pipe(command: String): RDD[String]

    Definition Classes
    RDD
  94. final def preferredLocations(split: Partition): Seq[String]

    Definition Classes
    RDD
  95. def randomSplit(weights: Array[Double], seed: Long): Array[RDD[Array[Byte]]]

    Definition Classes
    RDD
  96. def reduce(f: (Array[Byte], Array[Byte]) ⇒ Array[Byte]): Array[Byte]

    Definition Classes
    RDD
  97. def regexSearch(query: String): RDD[Array[Byte]]

    Searches of all records that contains a regular expression search query and returns all such records.

    Searches of all records that contains a regular expression search query and returns all such records.

    query

    The regular expression search query.

    returns

    The RDD containing all records that match the regular expression search.

    Definition Classes
    SuccinctRDD
  98. def regexSearchOffsets(query: String): RDD[RegExMatch]

    Searches for the input regular expression within each RDD and returns results as (offset, length) pairs.

    Searches for the input regular expression within each RDD and returns results as (offset, length) pairs. The query must be UTF-8 encoded.

    query

    The regular expression search query.

    returns

    RDD of matched pattern occurrences.

    Definition Classes
    SuccinctRDD
  99. def repartition(numPartitions: Int)(implicit ord: Ordering[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  100. def sample(withReplacement: Boolean, fraction: Double, seed: Long): RDD[Array[Byte]]

    Definition Classes
    RDD
  101. def save(location: String): Unit

    Saves the SuccinctRDD at the specified path.

    Saves the SuccinctRDD at the specified path.

    location

    The path where the SuccinctRDD should be stored.

    Definition Classes
    SuccinctRDD
  102. def saveAsObjectFile(path: String): Unit

    Definition Classes
    RDD
  103. def saveAsTextFile(path: String, codec: Class[_ <: CompressionCodec]): Unit

    Definition Classes
    RDD
  104. def saveAsTextFile(path: String): Unit

    Definition Classes
    RDD
  105. def search(query: String): SearchRecordIdResultsRDD

    Searches for all records that match a query and returns results as recordIds relative to each partition.

    Searches for all records that match a query and returns results as recordIds relative to each partition.

    query

    The search query.

    returns

    The SearchRecordIdResultsRDD corresponding to the search query.

    Definition Classes
    SuccinctRDD
  106. def search(query: Array[Byte]): SearchRecordIdResultsRDD

    Searches for all records that match a query and returns results as recordIds relative to each partition.

    Searches for all records that match a query and returns results as recordIds relative to each partition.

    query

    The search query.

    returns

    The SearchRecordIdResultsRDD corresponding to the search query.

    Definition Classes
    SuccinctRDD
  107. def searchOffsets(query: String): RDD[Long]

    Search for all occurrences of a query string within the RDD.

    Search for all occurrences of a query string within the RDD.

    query

    The search query.

    returns

    The RDD of recordIds.

    Definition Classes
    SuccinctRDD
  108. def searchOffsets(query: Array[Byte]): RDD[Long]

    Search for all occurrences of a query within the RDD.

    Search for all occurrences of a query within the RDD.

    query

    The search query.

    returns

    The RDD of iterables over recordIds into each partition.

    Definition Classes
    SuccinctRDD
  109. def setName(_name: String): SuccinctRDDImpl.this.type

    Set the name for the RDD; By default set to "SuccinctRDD"

    Set the name for the RDD; By default set to "SuccinctRDD"

    Definition Classes
    SuccinctRDDImpl → RDD
  110. def sortBy[K](f: (Array[Byte]) ⇒ K, ascending: Boolean, numPartitions: Int)(implicit ord: Ordering[K], ctag: ClassTag[K]): RDD[Array[Byte]]

    Definition Classes
    RDD
  111. def sparkContext: SparkContext

    Definition Classes
    RDD
  112. def subtract(other: RDD[Array[Byte]], p: Partitioner)(implicit ord: Ordering[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  113. def subtract(other: RDD[Array[Byte]], numPartitions: Int): RDD[Array[Byte]]

    Definition Classes
    RDD
  114. def subtract(other: RDD[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  115. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  116. def take(num: Int): Array[Array[Byte]]

    Definition Classes
    RDD
  117. def takeOrdered(num: Int)(implicit ord: Ordering[Array[Byte]]): Array[Array[Byte]]

    Definition Classes
    RDD
  118. def takeSample(withReplacement: Boolean, num: Int, seed: Long): Array[Array[Byte]]

    Definition Classes
    RDD
  119. val targetStorageLevel: StorageLevel

    The storage level for the RDD.

  120. def toDebugString: String

    Definition Classes
    RDD
  121. def toJavaRDD(): JavaRDD[Array[Byte]]

    Definition Classes
    RDD
  122. def toLocalIterator: Iterator[Array[Byte]]

    Definition Classes
    RDD
  123. def toString(): String

    Definition Classes
    RDD → AnyRef → Any
  124. def top(num: Int)(implicit ord: Ordering[Array[Byte]]): Array[Array[Byte]]

    Definition Classes
    RDD
  125. def treeAggregate[U](zeroValue: U)(seqOp: (U, Array[Byte]) ⇒ U, combOp: (U, U) ⇒ U, depth: Int)(implicit arg0: ClassTag[U]): U

    Definition Classes
    RDD
  126. def treeReduce(f: (Array[Byte], Array[Byte]) ⇒ Array[Byte], depth: Int): Array[Byte]

    Definition Classes
    RDD
  127. def union(other: RDD[Array[Byte]]): RDD[Array[Byte]]

    Definition Classes
    RDD
  128. def unpersist(blocking: Boolean = true): SuccinctRDDImpl.this.type

    Un-persists the Succinct partitions using the specified blocking mode.

    Un-persists the Succinct partitions using the specified blocking mode.

    Definition Classes
    SuccinctRDDImpl → RDD
  129. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  130. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  131. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  132. def zip[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(Array[Byte], U)]

    Definition Classes
    RDD
  133. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D])(f: (Iterator[Array[Byte]], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  134. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D], preservesPartitioning: Boolean)(f: (Iterator[Array[Byte]], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  135. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C])(f: (Iterator[Array[Byte]], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  136. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C], preservesPartitioning: Boolean)(f: (Iterator[Array[Byte]], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  137. def zipPartitions[B, V](rdd2: RDD[B])(f: (Iterator[Array[Byte]], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  138. def zipPartitions[B, V](rdd2: RDD[B], preservesPartitioning: Boolean)(f: (Iterator[Array[Byte]], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Definition Classes
    RDD
  139. def zipWithIndex(): RDD[(Array[Byte], Long)]

    Definition Classes
    RDD
  140. def zipWithUniqueId(): RDD[(Array[Byte], Long)]

    Definition Classes
    RDD

Deprecated Value Members

  1. def filterWith[A](constructA: (Int) ⇒ A)(p: (Array[Byte], A) ⇒ Boolean): RDD[Array[Byte]]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and filter

  2. def flatMapWith[A, U](constructA: (Int) ⇒ A, preservesPartitioning: Boolean)(f: (Array[Byte], A) ⇒ Seq[U])(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and flatMap

  3. def foreachWith[A](constructA: (Int) ⇒ A)(f: (Array[Byte], A) ⇒ Unit): Unit

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and foreach

  4. def mapPartitionsWithContext[U](f: (TaskContext, Iterator[Array[Byte]]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @DeveloperApi() @deprecated
    Deprecated

    (Since version 1.2.0) use TaskContext.get

  5. def mapPartitionsWithSplit[U](f: (Int, Iterator[Array[Byte]]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 0.7.0) use mapPartitionsWithIndex

  6. def mapWith[A, U](constructA: (Int) ⇒ A, preservesPartitioning: Boolean)(f: (Array[Byte], A) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex

  7. def toArray(): Array[Array[Byte]]

    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use collect

Inherited from SuccinctRDD

Inherited from RDD[Array[Byte]]

Inherited from Logging

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped