You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Options: -window [int] bp size of the sliding window, default 3,000,000
-step [int] bp size of the moving step, defalut 300,000
-intact indicate the -all file is an LTR_retriever .pass.list instead of a RepeatMasker .out file
The .out.LTR.distribution.txt file is generated by default.
Add a script for whole-genome forward simulation (randomly add mutations on the genome)
Applied basic hit filtering: SW_score>=300, alignment length >= 80 bp
Add more usage information to -h
Update README
Bug fixed
Program halt when nothing is masked in truncated candidates.
Program halt when multiple LTR_retriever tasks simutainously check RepeatMasker in the same directory
substr sequences out of range when self-corrected reads are used as input
LAI Version b2
Rewrite LTR_calc.pl with more accurate and efficient algorithms.
Add the -step parameter for overlapping-sliding window scheme to estimate LAI
Output the size of the genome for genome LAI
Memory consumption of this scrip is approx. 2X the size of the input genome
To control the boom and bust dynamic of LTR-RTs, adjust the raw LAI based on LTR identity.
Estimate mean identity of LTR sequences in a genome using all-versus-all blastn search
Add a quick estimation (-q) of genomic LTR identity based on a log-linear model with the slope estimated from three small subsets of LTRs
To avoid abnormal adjustment, if estimated LTR identity <= 92% or >= 96%, then corrected it to 92% or 96%, respectively
Use the -unlock parameter to release the restriction of LTR identity ([92, 96]) for good genomes with extreme LTR activities
Set LAI_adj=0 if raw LAI==0
The alignment identity cutoff (-iden) can excludes hits higher than this value for LTR identity calculation. Default: 100 (%)
Change the output naming of LAI to raw_LAI and LAI_adj to LAI for easier description.
Add polyploid support.
If the input genome is a polypoid (diploidized ancient polypoid does not count), then only a set of chromosomes (1x, a monoploid) should be used to estimate LAI, otherwise the LTR identity will be erroneously estimated to a very high value and substantially decrease the LAI.
Use the -mono parameter to provide a list of chromosome names of a monoploid, LAI will be calculated only on these sequences.
Users can run LAI multiple times with different monoploids specified to obtain the whole genome LAI estimation.
Set prerequisites of LAI estimation
set intact LTR-RT limit >= 0.01%;
set total LTR limit >= 5%
Add the -totLTR parameter for customized total LTR content;
Add the -window parameter to control window size
Add the rush mode (-qq) to quickly estimate raw LAI for version comparison. Raw LAI should not be used to compare between different species because the LTR dynamic is not controlled.
Add status output of the LAI program. LAI is a default output of LTR_retriever. You should rerun LAI with the -mono parameter if the target genome is a polyploid.
Add Mac OsX support (High Sierra v10.13.3 tested).