Coder Social home page Coder Social logo

Comments (13)

nextgenusfs avatar nextgenusfs commented on July 20, 2024

So in the error result, is mockIM4p82redo the option passed to -b or is it mockIM4 as you said in the command before the error message? Maybe that combination of options is causing an error, give me a few minutes to take a look.

from amptk.

devonorourke avatar devonorourke commented on July 20, 2024

It's kind of odd. Because our compute cluster uses SLURM to submit jobs, I generate two .log files. The .amptk-filter.log file doesn't show any error, while the SLURM-generated .log file is the one that's throwing the Python error.

If I try to run the script with just the -b mockIM4p82redo flag (and no --mc info) the operation fails and the .amptk-filter.log generates an error stating that I have to include the --mc file.

So I have discovered how to make errors all over the place, ha!

I've also tried altering the --subtract flat to auto instead of an integer, but the same Python error is thrown.

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

So the SLURM job is erring out and not amptk filter?

from amptk.

devonorourke avatar devonorourke commented on July 20, 2024

Sorry. It appears that the information being generated in the SLURM log file is specific to the output from amptk... here's the whole shebang:

      /\^._.^/\     starting process: Sun Jan 28 15:46:55 EST 2018
[03:46:56 PM]: OS: linux2, 24 cores, ~ 131 GB RAM. Python: 2.7.14
[03:46:57 PM]: AMPtk v1.0.3, USEARCH v9.2.64, VSEARCH v2.6.2
[03:46:57 PM]: Loading OTU table: ../dropd.cluster.otu_table.txt
[03:46:57 PM]: OTU table contains 953 OTUs and 9,335,697 read counts
[03:46:57 PM]: Mapping OTUs to Mock Community (USEARCH)
[03:46:57 PM]: Sorting OTU table naturally
[03:46:57 PM]: Removing OTUs according to --min_reads_otu: (OTUs with less than 2 reads from all samples)
[03:46:57 PM]: Found 10 mock chimeras (only in mock sample and not mapped to mock sequences) excluding from index-bleed calculation
[03:46:57 PM]: Index bleed, mock into samples: 8.307496%.  Index bleed, samples into mock: 0.050620%.
[03:46:57 PM]: Overwriting auto detect index-bleed, setting to 1.000000%
[03:46:57 PM]: Auto subtract filter set to 433
[03:46:57 PM]: Subtracting 433 from OTU table
-------------------------------------------------------
Traceback (most recent call last):
  File "/mnt/lustre/macmaneslab/devon/pkgs/amptk-1.0.3/bin/amptk-filter.py", line 484, in <module>
    mocks = final[args.mock_barcode]
  File "/mnt/lustre/software/linuxbrew/colsa/Cellar/python/2.7.14/lib/python2.7/site-packages/pandas/core/frame.py", line 2139, in __getitem__
    return self._getitem_column(key)
  File "/mnt/lustre/software/linuxbrew/colsa/Cellar/python/2.7.14/lib/python2.7/site-packages/pandas/core/frame.py", line 2146, in _getitem_column
    return self._get_item_cache(key)
  File "/mnt/lustre/software/linuxbrew/colsa/Cellar/python/2.7.14/lib/python2.7/site-packages/pandas/core/generic.py", line 1842, in _get_item_cache
    values = self._data.get(item)
  File "/mnt/lustre/software/linuxbrew/colsa/Cellar/python/2.7.14/lib/python2.7/site-packages/pandas/core/internals.py", line 3838, in get
    loc = self.items.get_loc(item)
  File "/mnt/lustre/software/linuxbrew/colsa/Cellar/python/2.7.14/lib/python2.7/site-packages/pandas/core/indexes/base.py", line 2524, in get_loc
    return self._engine.get_loc(self._maybe_cast_indexer(key))
  File "pandas/_libs/index.pyx", line 117, in pandas._libs.index.IndexEngine.get_loc
  File "pandas/_libs/index.pyx", line 139, in pandas._libs.index.IndexEngine.get_loc
  File "pandas/_libs/hashtable_class_helper.pxi", line 1265, in pandas._libs.hashtable.PyObjectHashTable.get_item
  File "pandas/_libs/hashtable_class_helper.pxi", line 1273, in pandas._libs.hashtable.PyObjectHashTable.get_item
KeyError: 'mockIM4p82redo'
      /\^._.^/\     ending process: Sun Jan 28 15:46:57 EST 2018

So from what I can tell, the process stops once amptk begins to subtract the values from the OTU table, with the trouble having something to do with the mock community name.

I hope you like my computer bats in the log file.

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

Yeah, okay, I think i just was able to get it to spit me the same error locally. I'll debug now

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

well, this a6b5499 should fix it. Not sure why I wrote that in the first place, I think it had something to do with when I was calculating error rate.

from amptk.

devonorourke avatar devonorourke commented on July 20, 2024

ok - should I edit the script to delete it myself, or do you pass a new version?
thanks for looking into it so quick

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

if you can edit your copy that would be probably better, then I won't rush a release before it has been run through my tests. I have a few other minor things I wanted to address.

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

I gather you can see how often I use the subtract filter.....

from amptk.

devonorourke avatar devonorourke commented on July 20, 2024

Yep, just edited the local version with that change and the script proceeded just fine.

Thank you !

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

Great, let me know when you run into more problems!

from amptk.

devonorourke avatar devonorourke commented on July 20, 2024

yeah, I'm a trouble maker.

from amptk.

nextgenusfs avatar nextgenusfs commented on July 20, 2024

Let's leave this issue open in case others have same problem. I will close when I do the next release.

from amptk.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.