NEWS
tidytable 0.11.2
Bug fixes
- Empty
dt()
works with no warning, #824
tidytable 0.11.1 (2024-07-18)
Functionality improvements
pmap()
now preserves names (#809)
- Package developers no longer need to define
.datatable.aware = TRUE
when
using tidytable
in their package (#269)
Bug fixes
- Attempting to rename columns using
group_by()
now leads to an error (#799)
pmap()
family works with data frame inputs (#803)
filter()
properly handles when comparing to NA
when .by
is used (#812)
paged.print
has been removed since it was breaking console printing (#810)
tidytable 0.11.0 (2024-02-09)
Functionality improvements
distinct()
now works on data frames with list columns (#773)
pivot_wider()
: Gains unused_fn
argument (#698)
Deprecations
verb.()
functions have been removed
Bug fixes
count()
has a more helpful error message when pick()
is used (#778)
unnest(keep_empty = TRUE)
preserves vectors of length 0 (#783)
Functions with notable speed improvements
tidytable 0.10.2 (2023-10-04)
- Patch release to pass CRAN checks for future R release
tidytable 0.10.1 (2023-04-20)
New functions
Bug fixes
case_when()
: .default
is used when calculating a common ptype (#751)
pivot_wider()
: Works correctly with dates (#759)
Functions with notable speed improvements
case_match()
: When comparing input vector to a length 1 literal
tidytable 0.10.0 (2023-03-09)
Deprecations
verb.()
functions are now deprecated (e.g. mutate.()
). Users should now use
the verb()
versions of functions.
New functions
Functionality improvements
nest()
: Gains .by
and .key
args
tidytable()
: Auto-names unnamed inputs
slice_*()
family: by
arg added to match dplyr
semantics.
.by
can still be called by the user instead of by
.
dt()
: Can use let()
to add columns even on older versions of data.table
Bug fixes
expand_grid()
: Can use "unique" or "sorted" as column names (#718)
Breaking changes
slice_head()
/_tail()
: .by
arg must be explicitly named when slicing by group
Functions with notable speed improvements
mutate()
: When overwriting existing columns on data frames with a high
number of columns (>1000)
tidytable 0.9.2 (2023-01-13)
New functions
cross_join()
group_cols()
map_vec()
pick()
separate_longer_delim()
separate_wider_delim()
separate_wider_regex()
Functionality improvements
separate()
: Can now handle when too many or too few new names are
specified in into
arg (#666)
unnest_longer()
: Gains keep_empty
arg
Bug fixes
separate()
: Can overwrite separated column without removal (#680)
na_if()
: Properly replaces NA
s when y
is a vector (#689)
across()
: Anonymous functions are properly translated (#699)
pivot_wider()
: names_sort = FALSE
works (#705)
Deprecations
- Using
by = character()
in joins is now deprecated. Users should instead use cross_join()
.
tidytable 0.9.1 (2022-10-25)
New functions
nest_join()
- Ranking functions:
min_rank()
dense_rank()
percent_rank()
cume_dist()
Functionality improvements
%in%
falls back to base::'%in%'
when input types aren't compatible
with vec_in()
(@krterberg, #632)
relocate()
: Can rename columns that are moved
- Joins: Can now do cross joins by specifying
by = character()
group_by()
: Gains .add
argument
ungroup()
: Gains ...
arguments
- Printing of grouped tidytables now shows grouping variables
Bug fixes
row_number()
: Works correctly on 0-row data frame when overwriting existing column (#639)
slice_head()
/slice_tail()
: Properly slice on 0-row data frame (#642)
Functions with notable speed improvements
tidytable 0.9.0 (2022-09-23)
Dotless functions!
tidytable
now exports dotless versions of all functions (e.g. arrange()
/mutate()
/etc.).
verb.()
syntax is still available to users for backwards compatibility.
- Users can use both
tidytable
and dplyr
by simply loading
dplyr
after tidytable
, as the verb.()
functions won't be overwritten by dplyr
.
New functions
dplyr
-style interface to grouping
group_by()
/ungroup()
group_vars()
is_grouped_df()
rowwise()
add_tally()
/tally()
case_match()
Functionality improvements
summarize()
: Gains .unpack
argument
pivot_longer()
: Can pass a single ptype or function to
values_ptypes
/values_transform
/names_ptypes
/names_transform
args.
unnest_longer()
/unnest_wider()
: Can pass a single ptype or function to
ptype
/transform
args.
Bug fixes
tidytable::'%in%'
dispatches to base::'%in%'
when comparing with a list (#563)
pivot_wider()
: Works with column names with spaces (#569)
pivot_wider()
: names_glue="{.value}_{somecolumn}"
assigns column names in
correct order (@Darxor, #579)
left_join()
: Works when y matching columns in by
is a non matching column of x (#625)
tidytable 0.8.1 (2022-08-17)
New Functions
consecutive_id.()
if_else.()
- Note: Alternate syntax
ifelse.()
(introduced in v0.4.0) will remain in the package
%in%
Functionality improvements
arrange.()
: Can use .env
inside arrange expressions
case_when.()
: Gains .default
/.ptype
/.size
args
coalesce.()
: Gains .ptype
/.size
args
relocate.()
: Now properly handles multiple columns selected in .before
or .after
slice_min.()
/slice_max.()
: Gain with_ties
argument
Bug Fixes
- Nested calls to
across.()
are handled properly (#505)
across.()
: Can namespace functions in .fns
arg (#511)
as_tidytable()
: Can keep row names when converting a matrix (#527)
unnest.()
: Handles empty data frames (@roboton, #530)
nth.()
: Extracts list elements (#534)
arrange.()
: Properly sorts NA
s (#541)
Deprecations
arrange_across.
/mutate_across.
/summarize_across.
are now defunct. They have been
deprecated with warnings since v0.6.4 (Jul 2021). Users must now use across.()
inside
arrange.()
/mutate.()
/summarize.()
.
tidytable 0.8.0 (2022-06-11)
New functions
Functionality improvements
expand_grid.()
: Works with data frame inputs
first.()
/last.()
/nth.()
: Gain na_rm
arg
mutate_rowwise.()
: Gains .keep
, .before
, and .after
args
tidytable()
: Auto-unpacks unnamed data frame inputs
Breaking changes
count.()
: Default name is now n
instead of N
to match dplyr semantics
Bug fixes
bind_cols.()
: Correctly handles lists (#446)
Functions with notable speed improvements
arrange.()
case_when.()
: Faster when conditions evaluate to NA
group_split.()
left_join.()
: Faster when keep = FALSE
(the default)
select.()
uncount.()
Other notes
tidytable
no longer directly depends on lifecycle
tidytable 0.7.2 (2022-04-28)
New functions
Functionality improvements
summarize.()
: Now sorts by the grouping variables when .by
is used.
dt()
: Experimental support for tidy evaluation
as_tidytable()
: Now defaults to .name_repair = "unique"
to relax restrictions on creating new tidytables
Functions with notable speed improvements
dt()
: Faster when adding new columns or updating existing columns
as_tidytable()
: Faster when converting data.frame
or list
objects to a tidytable
get_dummies.()
Bug fixes
across.()
: .cols
arg can find environment variables in custom functions (#389)
- Joins: Duplicate columns are treated properly when
keep = FALSE
(#397)
- Can use anonymous functions inside
map
functions inside mutate.()
(#402)
tidytable 0.7.1 (2022-03-22)
New functions
add_count.()
cur_column.()
cur_data.()
Bug fixes
unite.()
: New column is always placed before the first united column in the data frame,
regardless of order provided.
- Can use
.data
pronoun inside desc()
in arrange.()
(#371)
tidytable 0.7.0 (2022-02-16)
Functionality improvements
left
/right
/inner
/full
joins gain suffix
and keep
args (#354)
rename.()
: Can now rename columns by position (#361)
unite.()
: The new column is placed before united columns to match tidyr behavior
New functions
tidytable 0.6.7 (2022-01-14)
Functionality improvements
.data
and .env
pronouns now work in tidytable
functions
across.()
: Works inside of a named mutate.()
. Useful with rowSums
/rowMeans
(#346)
first.()
/last.()
: Gain a default
arg
mutate.()
: Can use stringr::str_glue()
without specifying .envir
replace_na.()
: Checks that replace
arg only uses columns that exist in the data frame
New functions
Bug fixes
mutate.()
: Can assign to the same column when .by = character(0)
(#332)
tidytable 0.6.6 (2021-12-10)
Functionality improvements
between()
is now auto-translated to between.()
when used inside tidytable
functions
%notin%
now uses %chin%
on character vectors
Bug fixes
across.()
: Can pass extra arguments to a list of functions (#319)
tidytable 0.6.5 (2021-09-03)
New functions
Bug fixes
- Traceback error messages no longer unnecessarily print the full data frame (#305)
tidytable 0.6.4 (2021-07-31)
New functions
unnest_longer.()
unnest_wider.()
Functionality improvements
bind_rows.()
/bind_cols.()
- Can splice lists of data frames using
!!!
mutate.()
: Can use glue::glue()
without specifying .envir
pull.()
: Added a name
argument
separate.()
: Can omit output columns by using NA
in into
tidytable()
: Can splice quosures
unnest.()
: Added keep_empty
arg
Deprecations
- The following functions have all been deprecated due to the addition of
across.()
in v0.6.1
arrange_across.()
mutate_across.()
summarize_across.()
tidytable 0.6.3 (2021-06-22)
New functions
enframe.()
first.()
last.()
Functionality improvements
- Subclasses and attributes are now preserved when using
bind_cols
,
bind_rows
, group_split
, or joins.
complete.()
: Added a .by
argument
expand.()
: Added a .by
argument
mutate.()
: Auto-names unnamed inputs
transmute.()
: Auto-names unnamed inputs
Bug fixes
expand.()
with nesting.()
- Additional double vectors can have negative values (#282)
tidytable 0.6.2 (2021-05-18)
New functions
cur_group_id.()
/cur_group_rows.()
n_distinct.()
nesting.()
Functionality improvements
mutate.()
:
- Added
.keep
argument
- Added
.before
/.after
arguments
tidytable()
: Added .name_repair
argument
Functions with notable speed improvements
filter.()
when .by
is used
tidytable 0.6.1 (2021-04-16)
New functions
Functionality improvements
group_split.()
: Added a .named
argument. If .named = TRUE
, the resulting list will
have names that indicate the group they are a part of.
pivot_longer.()
: Can now use names_to = c(".value", NA)
to drop the id column.
Bug fixes
if_all.()
/if_any.()
- No longer try to use
.by
columns (#225)
- Can use
n()
in filtering (#226)
pivot_longer.()
- No longer converts factor values to character (#202, #234)
- Preserves column order when using
names_to = c(".value", "id")
(#235)
tidytable 0.6.0 (2021-04-02)
New functions
Functionality improvements
replace_na.()
: Can now replace NULL
values in list-columns
- Splicing lists using
!!!
now works in crossing.()
and expand_grid.()
- The following tidyverse functions are now automatically converted to their tidytable
verb.()
equivalents when
used inside of tidytable's mutate/arrange/filter/summarize/slice.
- Functions that previously failed but now work:
- Functions that previously worked with tidytable but are now converted to their faster tidytable equivalents:
if_else()
/ifelse()
case_when()
desc()
replace_na()
Bug fixes
pivot_longer.()
: Correctly pivots unbalanced datasets when ".value"
is used
in names_to
(@moutikabdessabour, #212)
Functions with notable speed improvements
mutate_across.()
mutate_rowwise.()
when c_across.()
is used
slice.()
/slice_head.()
/slice_tail.()
/slice_min.()
/slice_max.()
Other news
tidytable
has dropped the tibble
package as a dependency and now uses pillar
instead.
tibble
was originally used for its trunc_mat()
function that enabled tidytables to print like
tibbles do in console. pillar
is the same package that tibble
uses in the background.
tidytable 0.5.9 (2021-03-04)
New functions
Functionality improvements
as_tidytable()
: Gains .name_repair
and .keep_rownames
args
count.()
: Added wt
, sort
and name
args. (@moutikabdessabour, #196)
mutate.()
: Variables are now updated in order when using ".by" (#166)
pivot_longer.()
- Added
names_prefix
arg
- Can now use
".value"
in names_to
pivot_wider.()
: Can now use ".value"
in names_glue
separate.()
: Added convert
arg
summarize.()
: Variables are now created in order (#188)
tidytable 0.5.8 (2021-01-28)
Breaking changes
pivot_longer.()
: values_drop_na
is no longer the 5th argument in the function
pivot_wider.()
: values_fn
& values_fill
args are no longer the 6th & 7th args
New functions
c_across.()
extract.()
mutate_rowwise.()
nest.()
slice_sample.()
Functionality improvements
pivot_longer.()
- New args:
names_sep
, names_pattern
, names_ptypes
, names_transform
, names_repair
,
values_ptypes
, values_transform
, fast_pivot
fast_pivot
arg: Faster pivoting. The "names" column will be a factor instead of character.
Default is set to FALSE
to match tidyverse semantics. Note: This option sets variable.factor = TRUE
in data.table::melt()
, which is what leads to it being faster.
pivot_wider.()
- New args:
names_prefix
, names_glue
, names_sort
, names_repair
tidytable 0.5.7 (2020-12-16)
Deprecations
- The
dt_verb()
versions of functions have been removed from the package.
They have been deprecated with warnings since v0.5.2
Functionality improvements
crossing.()
: Now works with data frame inputs (@moutikabdessabour, #143)
distinct.()
: Can rename columns (#153)
get_dummies.()
: Dummy columns are now sorted in alphabetical order
pivot_wider.()
: Added values_fill
argument
Functions with notable speed improvements
get_dummies.()
fill.()
slice.()
Bug fixes
mutate.()
: Can delete a column using NULL
when .by
is provided (#151)
mutate_across.()
: Can reference other columns in .fns call (#145)
slice.()
: Works when .by
contains all columns (#158)
unnest.()
: Works when the only column is the list column (#144)
tidytable 0.5.6 (2020-09-30)
Breaking changes
bind_cols.()
: Name repair uses vec_as_names()
instead of vec_as_names_legacy()
Functionality improvements
bind_cols.()
: Added .name_repair
arg
unnest.()
: Added names_sep
and names_repair
args
print()
- Added
n
, width
, and n_extra
args
- Now prints like tibbles in all cases (special thanks to @moutikabdessabour)
slice.()
: Can now drop specified rows with negative numbers
New functions
arrange_across.()
case_when.()
- The old "case when" translation
case.()
will remain in the package,
as it is called like data.table::fcase()
but allows for the default
to be a vector.
desc.()
Functions with notable speed improvements
Deprecations
unnest.()
: .keep_all
arg changed to .drop
tidytable 0.5.5 (2020-09-02)
Breaking changes
bind_rows.()
: Removed .use_names
and .fill
args
- These are now assumed to be
TRUE
, which matches dplyr
semantics.
complete.()
: .fill
arg renamed to fill
pull.()
: defaults to var = -1
instead of var = NULL
- The result is the same when using the default value,
but will break cases where
var = NULL
was explicitly called.
slice.()
: .by
must be named when slicing by group
Functionality improvements
tidytable
now integrates better with tidyverse functions through vctrs
. This means code such as
vec_cbind(tibble::tibble(x = 1:3), tidytable(y = "foo"))
will bind the results into a tidytable.
complete.()
: Can now append extra values
pull.()
: Numerical selection now works
slice.()
: Now uses ...
to select which rows to slice
New functions
Bug fixes
rename.()
: Works when there are spaces in the column names (#109)
Functions with notable speed improvements
tidytable 0.5.4 (2020-08-10)
Breaking changes
- Column names auto-generated by
summarize_across.()
now use a suffix
instead of a prefix
Functionality improvements
mutate_across.()
: Added .names
arg to help with naming newly created columns
summarize_across.()
: Added .names
arg to help with naming newly created columns
summarize.()
: Added .sort
arg to optionally sort the resulting data.table
Bug fixes
transmute.()
: Doesn't drop "by" columns (#98)
slice.()
- Allows gaps in
rows
arg (#99)
- Doesn't reorder columns when using
.by
(#101)
full_join.()
: No longer returns a keyed data.table (#102)
Internal
- Import
tibble
- Enables cleaner console printing of tidytables
glimpse()
is now reexported
- Will be replaced by
pillar
in a later release once tibble::trunc_mat()
&
tibble::glimpse()
are moved there
tidytable 0.5.3 (2020-07-16)
Breaking changes
bind_rows.()
use.names
arg renamed to .use_names
fill
arg renamed to .fill
New functions
complete.()
crossing.()
expand.()
expand_grid.()
uncount.()
Functionality improvements
replace_na.()
: Now works on data.frame
/data.table
inputs as well as vectors
Deprecations
desc.()
has been deprecated due to incompatibility with character columns
tidytable 0.5.2 (2020-06-26)
- This version contains general performance improvements
Deprecations
- Group by arg
by
has been deprecated for .by
.
This will allow "by" to be used as a column name in mutate.()
& summarize.()
in future releases.
- All
dt_verb()
functions are soft deprecated
New functions
Functionality improvements
unnest.()
: Added .keep_all
arg to keep all list columns that were not unnested
tidytable 0.5.1 (2020-05-29)
New functions
summarize_across.()
& summarise_across.()
Bug fixes:
- Fixed
distinct.()
bug where it wasn't returning unique rows
tidytable 0.5.0 (2020-05-23)
tidytable
now works with quosures
tidyselect v0.1.1
compatibility: Updated functions to use where()
Functionality improvements
nest_by.()
: Added .keep
arg
Deprecations
- Old select helpers like
everything.()
are now replaced by normal helpers like everything()
rename_across/_if/_at/_all
have been superseded by rename_with.()
Breaking changes
group_split.()
: keep
arg renamed to .keep
New functions
Functions with notable speed improvements
tidytable 0.4.1 (2020-04-30)
tidytable
now utilizes tidyselect
& vctrs
Breaking changes
get_dummies.()
: cols
arg defaults to c(is.character, is.factor)
.
The result is the same, but is more consistent with other enhanced selection functions
mutate_across.()
: .funs
argument renamed to .fns
to match dplyr 1.0
New functions
Functionality improvements
- Paged printing now works in Rmarkdown
bind_rows.()
: Does name checking and fills missing
distinct.()
: Added .keep_all
argument
fill.()
: Preserves column order when using "by"
group_split.()
: Added keep
arg
nest_by.()
- Can now unnest multiple columns in one call
- data.tables in a list column can now have different ncols
and different column order
select.()
: Now allows column renaming
transmute.()
: Summary functions (mean
/max
/etc.) can now be used
Functions with notable speed improvements
distinct.()
drop_na.()
relocate.()
Bug fixes
group_split.()
: Now returns a list of tidytables instead of data.tables
left_join.()
: Now works with "by" columns of different names
tidytable 0.4.0 (2020-03-28)
New syntax
- All functions are now written as
verb.()
dt_verb()
functions still work, but may be slowly deprecated in future releases
Functionality improvements
- Enhanced selection now works in "by" calls
filter.()
now works with "by"
pivot_wider.()
can now be used with an aggregation function
pull.()
defaults to last column in a data.table
New functions
ifelse.()
n.()
nest_by.()
replaces dt_group_nest()
row_number.()
transmute.()
unnest.()
replaces dt_unnest_legacy()
- Select helper
any_of.()
Functions with notable speed improvements
fill.()
pivot_longer.()
unnest.()
tidytable 0.3.2 (2020-03-08)
tidytable class
- New
tidytable
class is a data.table subclass with cleaner printing.
- Operates like a data.table in all other cases.
- All tidytable functions automatically convert to
tidytable
class in the background.
New functions
- tidytable constructors:
tidytable()
: Create a data.table/tidytable
as_tidytable()
: Convert to data.table/tidytable
is_tidytable()
: Test if an object is a data.table/tidytable
dt_get_dummies()
: Get dummy variables
dt_separate()
dt_pivot_longer()
"cols" arg now defaults to dt_everything()
instead of NULL
.
The result is the same, but is more consistent with other enhanced selection functions
.N
now works when using dt_slice()
& variants
- Function length limit bug no longer occurs when using
~
in dt_map()
tidytable 0.3.1 (2020-02-19)
- Updated
dt()
docs per CRAN instructions
Breaking changes
dt_mutate()
& dt_rename()
no longer modify-by-reference
dt_unnest_legacy()
drops "keep" argument, but automatically keeps all non-nested columns.
- New translation is ~5-10x faster than before
Functionality improvements
dt()
now automatically converts data.frame input to a data.table
dt_mutate_across()
: Added "by" arg
Bug fixes
dt_left_join()
fix so columns are ordered correctly
tidytable 0.3.0 (2020-02-10)