Coder Social home page Coder Social logo

lyonslab / coge Goto Github PK

View Code? Open in Web Editor NEW
44.0 8.0 17.0 165.97 MB

CoGe (Comparative Genomics) Platform

Home Page: http://genomevolution.org

License: BSD 2-Clause "Simplified" License

Perl 41.39% Python 35.87% Shell 0.11% HTML 0.86% Makefile 0.06% C 12.15% Java 0.45% C++ 1.13% JavaScript 6.23% CSS 0.53% Haxe 0.91% RobotFramework 0.13% Roff 0.03% Rust 0.01% Raku 0.11% Cython 0.02%
coge comparative-genomics perl python synteny rna-seq bioinformatics genomics genomics-visualization

coge's People

Contributors

asherkhb avatar elyons avatar franka1 avatar groverj3 avatar mbomhoff avatar rchasman avatar sdavey avatar szawada avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

coge's Issues

SynMap job status is not updated to complete

When a new request is run SynMap sets its job status to running. When the analysis is completed the get_results function is called. The status continues to say running even after it has completed.

Add workflow monitoring

Need to add detailed monitoring of workflows. This should include job execution time. Start and Stop times.

Possible metrics

  • Average number of jobs
  • The number of running jobs
  • The number of completed jobs
  • Max/Current load

Add workflow cancellation

The cancel button on the job page must be able to cancel the job status. This should report to the user that job has been cancel.

Add new job status page.

The job status page needs to be updated to use the job engine.

  • Add job monitoring (report detailed information from the job engine)
  • Add workflow cancellation for users and administrators
  • Add job support for job cancellation
  • Add job restarting

Add workflow reload

The restart button on the job page should restart workflows. This should notify the user that the job has restarted.

Add chromosome sorting

The toolbar should have a feature which allows the chromosomes to be sorted by the name or size of the chromosome.

Add download dialog for analysis files

After each analysis is completed a list of available files to download is rendered to the client. The list size can be very long when running analysis tools that can run on many organisms at once.

Additionally, each file has to be downloaded manually. There is no way to download all the files at once.

A possible solution is to add a single download link which opens a download dialog. The user can then select the files they wish to download. The user then will be able to download a compress version of the files selected.

Redundant datasets being loaded

This log snippet shows multiple workers running the same task. Note that each worker is running on a separate port.

07/17/2014 06:29:50 yerba.workqueue:90[INFO] WORKQUEUE coge-main: The workflow 1868 is scheduling job /opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868'
2014/07/17 18:29:52.04 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:51667) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/17 19:09:57.13 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:37615) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/17 19:58:37.56 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:34189) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/17 21:02:12.21 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:33819) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/17 22:10:28.04 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:35633) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
This entry below show multiple workers working on the same task. Each worker connects on a seperate port.

2014/07/17 23:29:28.33 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:38539) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/18 00:30:13.76 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:59706) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/18 01:17:00.17 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:51691) busy on '/opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868''
2014/07/18 01:54:22.86 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:51691) sending back file-8b6583d2337d354fd783542ab701e0d1-1868 to /storage/coge/tmp/staging/lht/1868
2014/07/18 01:54:22.87 [68341] work_queue_python: wq: coge.iplantcollaborative.org (127.0.0.1:51691) sending back file-5205ac7aadc00cfaa101ddf54244c354-log.done to /storage/coge/tmp/staging/lht/1868/log.done
07/18/2014 01:54:22 yerba.workflow:279[INFO] WORKFLOW Create Genome: Added info to job /opt/apache/coge/scripts/load_genome.pl  -user_name lht -name "" -desc "only scaffolds above 20kbp" -version "1.1" -restricted 1 -source_name "University of Queensland" -organism_id "39853" -type_id "1" -staging_dir '/storage/coge/tmp/staging/lht/1868' -fasta_files 'SSPACE_trimmed_out.final.scaffolds.fasta.rename.abv20k' -irods_files '' -config coge.conf -result_dir '/storage/coge/tmp/results/lht/1868'

Add job level support

A workflow should have job level details that can be queried and viewed from the job engine.

  • Add job level status
  • Add job level cancellation
  • Add job level restarting for administrators

Add option to SynMap to clear cached results/files for admins

When a synmap analysis fails, it is often due to a problem generating one of the intermediate files required for the workflow. E.g., fasta file of CDS sequences, blast databases, etc. We have been manually going to server to find and delete these files when there is a bad analysis reported. Want an admin button to do this:

rm /scratch/coge/cache/fasta/$genomeID
rm -r /scratch/coge/cache/last/db/$genomeID
or
rm -r /scratch/coge/cache/blast/db/$genomeID
rm /opt/apache2/coge/web/data/fasta/$genomeID
rm -r /opt/apache2/coge/web/data/diags/*****/$genomeID

Improve the canvas render loop.

The current render loop iterates over all points sent to the client side. The rendering is triggered whenever the canvas is zoomed or panned. Additionally, the rendering is performed in a nonblocking manner. A suggest method to improve the performance would be implementing a Quad tree.

Run dotplot not generating axis

The dotplot.pl command cannot correctly determine the bounds for axis labels. The GD::Image is undefined and cannot proceed.

[Request] Consider extracting web/js/jbrowse/plugins/CoGe into a separate repo

Hello @LyonsLab CoGe JBrowse maintainers,

Just a thought, would you consider decoupling the jbrowse/plugins/CoGe directory from the main coge repo into its own standalone repository, so that, anyone who wishes to use your CoGe/View/Track/Wiggle/XYPlot track type (or other neat extensions) can easily clone the repo into their own JBrowse forks?

Thank you for your consideration.

docker

Hi,
Do you have any plans to provide a docker container?

Thank you in advance

Michal

Add toolbar to new dotplot

The toolbar will toggle various display options, load additional data, and perform additional interactions with the plot.

spaces in iRODS folder names causing failures

If there are spaces in iRODS folder names, CoGe's workflow will fail:

Fetching /iplant/home/arabiflora/coge_data/Concatenated AFG data/A02concat_R1.fastq
Job: mkdir -p /scratch/coge/tmp/uploads/arabiflora/FCFBB44EEBD611EAA59889A743376945/irods/iplant/home/arabiflora/coge_data/Concatenated AFG data && export irodsEnvFile='/opt/apache2/coge/irodsEnv' && iget -fT '/iplant/home/arabiflora/coge_data/Concatenated AFG data/A02concat_R1.fastq' '/scratch/coge/tmp/uploads/arabiflora/FCFBB44EEBD611EAA59889A743376945/irods/iplant/home/arabiflora/coge_data/Concatenated AFG data' && touch /scratch/coge/tmp/uploads/arabiflora/FCFBB44EEBD611EAA59889A743376945/irods/iplant/home/arabiflora/coge_data/Concatenated AFG data/A02concat_R1.fastq.done
Submitted at: 31/08/20 at 03:13:44PM
Completed at: 31/08/20 at 03:13:58PM
Execution time: 12.550773 sec
Assigned to task: 14389
Return status: 0
Expected outputs: /scratch/coge/tmp/uploads/arabiflora/FCFBB44EEBD611EAA59889A743376945/irods/iplant/home/arabiflora/coge_data/Concatenated AFG data/A02concat_R1.fastq, /scratch/coge/tmp/uploads/arabiflora/FCFBB44EEBD611EAA59889A743376945/irods/iplant/home/arabiflora/coge_data/Concatenated AFG data/A02concat_R1.fastq.done
Command Output:

Improve the CoGeBlast HPS Count table

The current HSP table generates a n column table where n is the number of selected genomes. This causes the table to render off the screen. A better method could be to render an interactive graph which can update the HSP table and corresponding layout.

Space in file name causing failure for workflow when loading genome.

CoGe's reported error:

######################### Sorting FASTA file Job: /opt/apache2/coge/scripts/sort_fasta.pl Pa_scaffold 0_7.fa > Pa_scaffold 0_7.fa.sorted Submitted at: 02/09/20 at 01:19:13AM Completed at: 02/09/20 at 01:19:14AM Execution time: 0.026153 sec Assigned to task: 16089 Return status: 2 Expected outputs: /scratch/coge/tmp/staging/twoehner85/272865/Pa_scaffold 0_7.fa.sorted Command Output: "Could not build fai index Pa_scaffold.fai\nCouldn't create index file at /opt/apache2/coge/scripts/sort_fasta.pl line

Note: file name is "Pa_scaffold 0_7.fa"

I have cached this file in my data store for debugging purposes.

Dots rendering off of the viewport

Data points are currently not being rendered within the plot. The rendering has been recently changed from rendering svg elements to drawing with a canvas see issue #29. The new canvas needs to be overlaid on the plot.

Add zoom reset to the toolbar

The user may wish to zoom back to the original size. The reset button will pan the user out restoring the view to the full view. This does not restore the user to any saved view or state.

I can'nt upload by jbrowse job

Good morning
I would like to upload my jbrowse job in coge genomeevolutions but I don't have enought options like metadata optionsa and developers mode, so I download CoGe and tried to install the whole programs, modules and components. I would like to know how can I upload my genome and annotation in jbrowse to genome evolution coge.
thanks

Add chromosome labels

The dotplot should display the chromosome name below the region of sequence. Labels need to be able to compress when the number of labels to be displayed is large.

CoGeBlast - lastz fails on some fasta files

Job: /usr/bin/nice --adjustment=10 /usr/local/bin/lastz CoGe_ZUlJXcog.fasta W=8 C=0 K=3000 M=0 O=400 E=30 4.faa > CoGe_ZUlJXcog-31.lastz
Submitted at: 06/09/13 at 01:22:11PM
Completed at: 06/09/13 at 01:22:16PM
Execution time: 0.230413 sec
Assigned to task: 1957
Return status: 256
FAILURE: bad fasta character in 4.faa, >gi|1: Y

jbrowse developers options

Hi, I use coge genome Evolutions, but can I add Hierarchical and Faceted track selectors to jbrowse throught CoGe.
I would like to get something like this:
59812768-bc37b400-92d4-11e9-91ab-2aafa4299fbb
59812755-b17d1f00-92d4-11e9-985d-035cf9b5db42

Add toggle for skipping random/unknown chromosomes

The current plot supports the ability to skip chromosomes labelled as random or unknown. Any points which reference these random or unknown chromosomes should not be rendered when this field is toggled. The ordering and offset will have to be recalculated.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.