R: summary

describe {SparkR}

R Documentation

summary

Description

Computes statistics for numeric and string columns. If no columns are given, this function computes statistics for all numerical or string columns.

Usage

describe(x, col, ...)

summary(object, ...)

## S4 method for signature 'SparkDataFrame,character'
describe(x, col, ...)

## S4 method for signature 'SparkDataFrame,ANY'
describe(x)

## S4 method for signature 'SparkDataFrame'
summary(object, ...)

Arguments

`x`	a SparkDataFrame to be computed.
`col`	a string of name.
`...`	additional expressions.
`object`	a SparkDataFrame to be summarized.

Value

A SparkDataFrame.

Note

describe(SparkDataFrame, character) since 1.4.0

describe(SparkDataFrame) since 1.4.0

summary(SparkDataFrame) since 1.5.0

Other SparkDataFrame functions: SparkDataFrame-class, agg, arrange, as.data.frame, attach, cache, coalesce, collect, colnames, coltypes, createOrReplaceTempView, crossJoin, dapplyCollect, dapply, dim, distinct, dropDuplicates, dropna, drop, dtypes, except, explain, filter, first, gapplyCollect, gapply, getNumPartitions, group_by, head, histogram, insertInto, intersect, isLocal, join, limit, merge, mutate, ncol, nrow, persist, printSchema, randomSplit, rbind, registerTempTable, rename, repartition, sample, saveAsTable, schema, selectExpr, select, showDF, show, storageLevel, str, subset, take, union, unpersist, withColumn, with, write.df, write.jdbc, write.json, write.orc, write.parquet, write.text

Examples

## Not run: 
##D sparkR.session()
##D path <- "path/to/file.json"
##D df <- read.json(path)
##D describe(df)
##D describe(df, "col1")
##D describe(df, "col1", "col2")
## End(Not run)

[Package SparkR version 2.1.2 Index]