dendropy.calculate.popgenstat
: Population Genetics Statistics¶
Population genetic statistics.
-
dendropy.calculate.popgenstat.
average_number_of_pairwise_differences
(char_matrix, ignore_uncertain=True)[source]¶ Returns $k$, calculated for a character block.
-
dendropy.calculate.popgenstat.
derived_state_matrix
(char_matrix, ancestral_sequence=None, derived_state_alphabet=None, ignore_uncertain=True)[source]¶ Given a list of CharDataSequence objects, and a reference ancestral sequence, this returns a list of strings corresponding to the list of CharDataSequence objects, where a ‘0’ indicates the ancestral state and ‘1’ a derived state.
e.g.
- Given:
GGCTAATCTGA GCTTTTTCTGA GCTCTCTCTTC
- with ancestral sequence:
GGTTAATCTGA
- this returns:
0010000000 0000110000 0001110011
-
dendropy.calculate.popgenstat.
nucleotide_diversity
(char_matrix, ignore_uncertain=True)[source]¶ Returns $pi$, calculated for a character block.
-
dendropy.calculate.popgenstat.
num_segregating_sites
(char_matrix, ignore_uncertain=True)[source]¶ Returns the raw number of segregating sites (polymorphic sites).
-
dendropy.calculate.popgenstat.
tajimas_d
(char_matrix, ignore_uncertain=True)[source]¶ Returns Tajima’s D.
-
dendropy.calculate.popgenstat.
unfolded_site_frequency_spectrum
(char_matrix, ancestral_sequence=None, ignore_uncertain=False, pad=True)[source]¶ Returns the site frequency spectrum of list of CharDataSequence objects given by char_sequences, with reference to the ancestral sequence given by ancestral_seq. If ancestral_seq is None, then the first sequence in char_sequences is taken to be the ancestral sequence.