Load data into a mavproject
prior to analysis. Data must
be formatted as a dataframe with samples in rows and loci in columns. If
individuals are polyploid then multiple rows can be used per sample. Ploidy
is allowed to vary between samples, and can be specified in multiple ways.
bind_data( project, df, ID_col = 1, pop_col = NULL, ploidy_col = NULL, data_cols = NULL, ID = NULL, pop = NULL, ploidy = NULL, missing_data = -9, wide_format = FALSE, name = NULL, check_delete_output = TRUE )
project | an rmaverick project, as produced by the function
|
---|---|
df | a dataframe containing genetic information and optional meta-data |
ID_col | which column of the input data contains the sample IDs. If NULL
then IDs must be defined seperately through the |
pop_col | which column of the input data contains the ostensible
population of the samples. If NULL then populations must be defined
seperately through the |
ploidy_col | which column of the input data contains the ploidy of the
samples. If NULL then ploidy must be defined seperately through the
|
data_cols | which columns of the input data contain genetic information. Defaults to all remaining columns of the data once special columns have been accounted for |
ID | sample IDs, if not using the |
pop | ostensible populations, if not using the |
ploidy | sample ploidy, if not using the |
missing_data | which value represents missing data |
wide_format | if |
name | optional name of the data set to aid in record keeping |
check_delete_output | whether to prompt the user before overwriting existing data |