the output ought to be in the same format than auto_find_circ, so some changes:
- in the output matrices the circexploere score is shown twice, and means something different than in find_circ_auto
- circexplorer would be able to attach genes to coordinates in the outputfile, but is now added later with the first matrix that is created
- manually circexplorer1_starter_1.pl with inputfile1 inputfile2 samplename ; then circexplorer1_out_reader.pl with its outfile, then optional matrixmaker/matrixtwo.pl
- auto_automaker:
~ head infiles_for_auto_automaker.txt
lineonefile1 linetwofile1 samplename1 group1
lineonefile2 linetwofile2 samplename2 group1
the group will lead to auto_automaker making a directory named after the group where all the resulting .csv files will be copied into, catted into one big .csv file and then run matrixmaker.pl with this as an input and then start matrixtwo.pl with this as an input
- let find_circ/godfather.pl handle it all, see find_circ/godfather.pl
- paired end RNA-seq files (sample1_line1.fastq and sample1_line2.fastq)
- get those file for RNAse treated cells from the same sample aswell
- install STAR aligner
- build STAR index with hg19.fa and ensembl annotations
- get filenames into the same dir, get a samplefile with files, samplename and group like find_circ_auto needs
- install circexplorer1
the scripts that handle the commands until the outputfiles are readable for the matrixmaker are of course different, the upper automation layers are close to the find_circ_auto repo with minor differences:
- the group/processsed.tsv instead of .csv: the files ate tab separated anyway, so using the corresponding .tsv here