Skip to main content

Search

Items tagged with: Bioinformatics


New tool from me: I wrote a local aligner that can do approximate reference-based alignment and find genes in a query microbial genome, without a separate indexing step or any temporary disk space usage.

Written fully in Rust and there will be a browser version out later: https://github.com/tmaklin/kbo 1/3

#rust #rustlang #bioinformatics


[New paper]: Two subtle problems with over-representation analysis.
ORA is a type of enrichment analysis that analyses over-represented functional categories in gene lists. These tools have accumulated ~190k citations, but they have subtly different behaviours. Here we unpack the differences and investigate two subtle problems in some implementations, which may have negatively impacted those 190k research papers.
https://doi.org/10.1093/bioadv/vbae159
#genomics #bioinformatics


Our new preprint on the impact of protein phosphorylation on structures is now out!

Protein phosphorylation is a key regulator of cellular processes.It's ubiquitous, yet the function and relevance of most phosphosites is unknown. Understanding phosphosite function requires elucidating the structural mechanisms through which it acts. (1)

https://www.biorxiv.org/content/10.1101/2024.10.18.617420v1?med=mas

@bioinformatics @strucbio #bioinformatics #protein #StructuralBiology


Profile and Binomica Labs registered on archaea.bio!

I'll be doing a lot more Halobacteria & Halococcus work in the near future, looking forward to sharing some interesting notes

https://www.archaea.bio/profiles/sung-won-lim

#microbiology #archaea #bioinformatics


The SplitsTree App: interactive analysis and visualization using phylogenetic trees and networks

https://www.nature.com/articles/s41592-024-02406-3

#evolution #bioinformatics


PhD Student in Virology/Bionformatics

Otto von Guericke Universität Magdeburg

The project will investigate genomic features in DVG and DI formation in #enteroviruses using #molecular biology and #bioinformatics.

See the full job description on jobRxiv: https://jobrxiv.org/job/otto-von-guericke-universitat-magdeburg-27778-phd-student-in-virology-bionformatics/?feed_id=...
https://jobrxiv.org/job/otto-von-guericke-universitat-magdeburg-27778-phd-student-in-virology-bionformatics/?feed_id=81269


Funded PhD Position in metagenomics of environmental films

Texas Tech University

Exciting funded PhD opportunity studying metagenomics of environmental films

See the full job description on jobRxiv: https://jobrxiv.org/job/texas-tech-university-27778-funded-phd-position-in-metagenomics-of-environmental-films/?feed_id=79057

#biofilms #bioinformatics #biology #metag...
https://jobrxiv.org/job/texas-tech-university-27778-funded-phd-position-in-metagenomics-of-environmental-films/?feed_id=79057


1) Want to know how much of your metagenome is eukaryotic? No references? No problem. We developed SingleM microbial fraction (SMF) and ran it on 250k metagenomes https://www.biorxiv.org/content/10.1101/2024.05.16.594470v1.

If you know what Eukaryotes are there, you can filter reads by mapping to their genomes. However, often you don’t know what’s in your sample, or the euk doesn’t have a genome.

#metagenomics #bioinformatics #genomics #microbiomes #microbialecology


"Indexing All Life's Known Biological #Sequences" https://www.biorxiv.org/content/10.1101/2020.10.01.322164v3

"In this work, we take advantage of recently developed, very efficient data structures and algorithms for representing sequence sets. We make Petabases of DNA sequences across all clades of life, including viruses, bacteria, fungi, plants, animals, and humans, fully searchable and make the indexes available to the research community"

#bioinformatics #database


Remembering the first Canadian #Bioinformatics Workshop: #CBW1999 … we used to use mouse pads back then! #25YearsofCBW This was a useful one for those didn’t know their standard genetic code by heart #oldbioinformaticsthings


Hello sugar people of the :fediverse: . My former collegue Bernard Henrissat now in 🇩🇰 is looking for a #PhD student in a Marie Sklodowska-Curie 🇪🇺 training network to work with him at the Technical University of Denmark. More info:
https://euraxess.ec.europa.eu/jobs/201030
The deadline for application is May 31st, 2024 and the job will start in November.

Skills desired: #bioinformatics, general biology (++ #carbohydrates) and fluency in 🇬🇧.
Eligibility: the candidate should not have worked in 🇩🇰 before.


Bioinformaticians are impatient👇
Conda is notorious slow... then I found mamba https://github.com/mamba-org/mamba, then I found
pixi https://prefix.dev/blog/pixi_a_fast_conda_alternative : Blazing fast cross-platform package management for teams.
By the creators of the mamba package manager. #bioinformatics


There's a new minimap2 release with ONT specific accurate long reads setting. Looking forward to giving it a test run!

https://github.com/lh3/minimap2/releases/tag/v2.27

#bioinformatics #microbiology


[New preprint] Direction-aware functional class scoring enrichment analysis of Infinium DNA methylation data
#Genomics #Epigenetics #Bioinformatics
In this article we outline a refined method for pathway enrichment of infinium array data that is more sensitive and precise as compared to existing over-representation approaches. Feedback welcome.
https://www.biorxiv.org/content/10.1101/2024.02.22.581670v1


Looking for some suggestions - I'm working with Archaeal genomic regions whose optimally fitting model seems to be Tamura-Nei (https://doi.org/10.1093/oxfordjournals.molbev.a040023).

Would it be too much to suspect the Archaeal region is under similar types of evolutionary pressure?

#archaea #microbiology #evolution #phylogenetics #bioinformatics


Large language models improve annotation of prokaryotic viral proteins

https://www.nature.com/articles/s41564-023-01584-8

#virology #viruses #bioinformatics #genomics


Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy

https://www.nature.com/articles/s41587-023-02100-3

#genomics #bioinformatics


Gotta check these out!

Robust, scalable, and informative clustering for diverse biological networks

https://genomebiology.biomedcentral.com/articles/10.1186/s13059-023-03062-0

#bioinformatics #genomics #genetics #statistics


Stowers is starting up a paid training program for bioinformatics, please share with anyone who might be interested in something like that! (in Kansas City)

https://www.stowers.org/gradschool/computational-biology-scholars

#bioinformatics


Contrasting drivers of abundant phage and prokaryotic communities revealed in diverse coastal ecosystems
https://doi.org/10.1038/s43705-023-00333-6
#microbiology #bioinformatics


If anyone is looking for the WGCNA tutorials while the Horvath's lab website has been down for a while, there's a Dropbox link with all the tutorials here: https://bioinformatics.stackexchange.com/a/21886

#bioinformatics


At my workplace, we're looking into how we can support the processing of very large datasets in R. It would be wonderful if some bioinformaticians could answer a couple of questions to direct us to the problem points.

We're hoping that we can publish something out of this that will be helpful to everyone in the field: https://forms.office.com/r/jNd2cbEZkh

#rstats #bioinformatics


Just FYI, as announced earlier this week, we've stopped using X/Twitter (both our OBF account and BOSC) https://www.open-bio.org/2023/11/20/leaving-x/ - Mastodon users can follow us here or @BOSC for our annual #Bioinformatics #OpenSource conference specifically.


Come work with us! I have a PhD position available to help understand which mutations in the non-coding parts of coding transcripts have functional consequences in Stem Cells.

We will using genome wide mesaurement of the effect of variation in miRNA binding sites to address when vairation causes changes in regulation and when it doesn't.

Mixed wetlab, bioinformatics and statistics project.

https://www.findaphd.com/phds/project/white-rose-bbsrc-dtp-how-can-we-identify-variants-in-untranslated-rna-that-affect-stem-cells/?p160958

#phdPosition #PhD #bioinformatics #UTR #miRNA #genetics


Very happy to share our new publication in PLOS ONE:

KIPEs3: Automatic annotation of biosynthesis pathways
https://doi.org/10.1371/journal.pone.0294342

Excellent work by Andreas Rempel (Bielefeld University) and Nancy Choudhary (@PuckerLab @tubraunschweig #Bioinformatics #OpenAccess


#bioawk is a command-line gem; it’s an extension of awk that auto-assigns variables for BED, SAM, VCF, GFF, and FASTX[AQ] format files, speeding up routine tasks.

For FASTX:
$1:name
$2:seq
$3:qual (FASTQ only)
$4:comment

Found in @vsbuffalo’s great #Bioinformatics Data Skills.


PhylteR: efficient identification of outlier sequences in phylogenomic datasets.

PhylteR can automatically identify sequences likely to be hidden paralogs or horizontally transferred genes in very large datasets. Removing those sequences therefore reduces noise in downstream analyses.

Available as an R package on CRAN or as docker and singularity images.

Package:

https://cran.r-project.org/web/packages/phylter/index.html

Paper:

https://doi.org/10.1093/molbev/msad234

#Phylogeny #Genomics #bioinformatics #Phylogenomics


#PhD positions available in my group for Fall 2024 - projects focusing on #GiantVirus diversity and #evolution. Opportunities for both molecular wet-lab and #bioinformatics research.

The lab is fun, supportive, and inclusive, and we have many cool new viruses we are studying, so come join us!

Please boost and spread the word!

https://jobrxiv.org/job/virginia-tech-27778-funded-phd-position/


Finally, after decades, Excel will let you opt-in to not having your data automatically mangled. A spreadsheet is still not a database.
https://insider.microsoft365.com/en-us/blog/control-data-conversions-in-excel-for-windows-and-mac
#excel #dataanalysis #research #bioinformatics


Registrations are now open for the 2nd edition of the course "Analysis of Prokaryotic Pangenomes" with @jomcinerney & Alan Beavan.

Check it out: https://physalia-courses.org/courses-workshops/prokaritotic-pangenomes/

#Pangenomes #Bioinformatics


We have a PhD vacancy. Structural and functional protein bioinformatics, with a focus on bacteria-phage interactions https://lu.varbi.com/en/what:job/jobID:668695/ #phage #bioinformatics


A new look for the group's website - #Arthropod Evolutionary-Functional #Genomics at http://rmwaterhouse.org 🐞🧬🦟🧬🦋🧬🐝
New Environmental #Bioinformatics Group
@SIB - #DataScience solutions to address environmental challenges https://sib.swiss/environmental-bioinformatics 💻🧬📊🌍🛜🌦️🌄📋🔭


What could go wrong?
#genomics #bioinformatics
https://www.biorxiv.org/content/10.1101/2023.09.08.556814v1


I just found `alv` (https://github.com/arvestad/alv), a tool for viewing multiple sequence alignments on the command line, and it's perfect. I needed a tool to highlight differences and `alv -f fasta -t aa --only-variable aln.afa` does just that. Would prefer the numbering to start at 1 since sequence variants are 1-based #bioinformatics (Data from ERR031940.)


Anyone have advice on how to get the Spades genome assembler to use less memory?

#Spades #bioinformatics

https://github.com/ablab/spades


I wrote a short guide on how to build both alignment-free and reference-based prokaryote phylogenetic trees from SNP alignments without using snippy, check it out

https://www.bacpop.org/guides/building_trees_with_ska/

#bioinformatics #genomics


After months of work I'm so happy that our new preprint is online!

Profiling the expression of #transportome genes in #cancer: a systematic approach.
https://www.biorxiv.org/content/10.1101/2023.07.18.549498v1

It's a bit unfinished but it's by design! We want and need #feedback from #bioinformatics and #physiology people to move forward. I've tried to follow #openscience principles, so the pipeline is completely autonomous, containerized and the code for the paper is there, included in the repos!


If I were to repeat mask a bunch of genomes (closely related) would the best option still be a home made pipeline of RepeatModeler + RepeatMasker these days (basically what I did there https://doi.org/10.1101/2022.09.02.506387)?
Or is there a ready to go solution here?
i.e. give all genomes and get them back soft-masked.
#bioinformatics #genomics

Lo, thar be cookies on this site to keep track of your login. By clicking 'okay', you are CONSENTING to this.