Comet release 2019.01
Documentation for parameters for release 2019.01
can be found here.
release 2019.01 rev. 5 (2019.01.5), release date 2020/04/06
- Bug fix: a bug in the code could randomly set the "minimum_peaks"
parameter value to a random large number, causing no peaks to be read
in, resulting in blank search results reported. Thanks to C. Bielow
for not only reporting the bug but also debugging the code and
implementing the fix.
- Known bug: Nucleotide database searches will fail. This bug was
introduced in version 2019.01.2. A fix has been implemented; the 2019.01.5
release binaries were updated with this fix on 2020/05/28.
Thanks to N. Tay for reporting the issue.
- Known bug: Searches of mzML and mzXML files without scan index do
not work under Windows. A fix has been implemented; the 2019.01.5
release binaries were updated with this fix on 2020/05/28.
Thanks to R. Marrisen for reporting the issue.
release 2019.01 rev. 4 (2019.01.4), release date 2020/01/15
- Add support for searching mzML and mzXML files that do not contain the
options scan index. Such files would previously have not been searched as
MSToolkit would throw an error message about the missing index.
- Bug fix: PEFF substitutions on flanking residues, generating a new
peptide that would otherwise not be analyzed, are treated more rigorously.
release 2019.01 rev. 3 (2019.01.3), release date 2019/12/06
- Bug fix: Searches against mzML files would report the wrong scan number.
The issue was due to the MSToolkit update for release 2019.01.0. Using the
latest MSToolkit commit 2021e7e from 12/3/19 fixes this error. Thanks to
Z. Sun for reporting the bug.
release 2019.01 rev. 2 (2019.01.2), release date 2019/11/18
- Bug fix: introduced in the 2019.01 rev. 0 release, Comet would not properly handle
a "clip_nterm_methionine_"
search. This bug would manifest as either a segmentation fault or as a NULL character
reported for a flanking residue due to not properly tracking the shortened
protein length when the start methione is clipped off. Thanks to the
Villen Lab and R. Johnson for reporting the issue.
release 2019.01 rev. 1 (2019.01.1), release date 2019/09/06
- Known bug: a NULL character can show up as the flanking residue
(peptide_next_aa attribute in pep.xml output) for an internal decoy match.
- In Percolator .pin output, change ExpMass and CalcMass from neutral
masses to singly protonated masses. Thanks to W. Fondrie and the
Crux team for reporting that Percolator expects these masses to be
singly protonated.
- Bug fix: correct missing residue in StaticMod header entry of SQT
output. Thanks to A. Zelter for reporting the issue.
- Bug fix: update database indexing and index search to correctly
handle terminal variable modifications. Thanks to T. Zhao for
reporting the issue.
release 2019.01 rev. 0 (2019.01.0), release date 2019/08/19
- Known bug: database indexing fails with variable modifications; particularly with
variable terminal mods. See the release_2019011 branch which contains a fix.
- Add support for user specified fragment neutral loss ions
(such as phosphate neutral loss) via the addition of an 8th field to
the "variable_mod01 to variable_mod09" parameters.
- Add support for user specified precursor neutral loss ions
using the "precursor_NL_ions" parameter.
- Add "max_duplicate_proteins"
parameter which can limit the number of protein
identifiers reported/returned for a given peptide.
- Add "peptide_length_range" parameter.
This parameter allows the specification of a minimum and maximum peptide length.
- Add "search_enzyme2_number"
parameter. Allows optional selection of a second digestion enzyme.
Enzyme specificity
and missed cleavage
settings are are shared between both
"search_enzyme_number"
and "search_enzyme2_number".
- Update "max_variable_mods_in_peptide"
parameter to support value 0.
- In the example comet.params files available here
and the when generated using "comet -p", the
"spectrum_batch_size"
parameter is now set to 15000 instead of 0.
For high-res "fragment_bin_tol"
settings, Comet should use less than 6GB of memory with a 15K batch size with very little
impact on search times compared to loading and searching all spectra at once.
If you have a computer with lots of ram, feel free to set this parameter value to 0 to gain
a bit of search speed. Note that when this parameter is missing, its default value is 0.
- Added indexed database support. This was implemented to support real-time
searching. A database index can be created with the command "comet.exe -i" which
will create an indexed database of the database specified in the comet.params file.
Static and variable modifications specified in the comet.params file will be
stored in the index database. Note an indexed database (with .idx extension) can
be searched if you specify the indexed database file as the search database
("database_name").
Searching an indexed database only uses a single thread so the
"num_threads"
parameter is ignored if an indexed database is specified.
- Support real-time searching of single spectra against an indexed database
with a C# interface for CometWrapper.dll.
This includes an example C# program (RealtimeSearch/Search.cs) that loops through all
scans of a Thermo raw file using Thermo's RawFileReader and sequentially calling
DoSingleSpectrumSearch() to run the search.
Thanks to D. Schweppe and the Gygi lab at Harvard and D. Bailey at Thermo for
contributions to this project. Some
notes on real-time search support are available here.
- Update MSToolkit from b41b33f commit on 1/30/19
- Bug fix: error with logic in analysis in target-decoy searches where
erroneous target and decoy fragment ions could be generated.
- Bug fix: add support for parsing '*' in the sequence which was removed in the
2018.01 rev. 0 release. This character is treated as a stop codon; residues
flanking this character are valid search enzyme termini.
- Features that didn't make the cut and are targeted for the next release:
mzIdentML output and Comet-PTM functionality.
Go download from the download page.