[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: demos.png (185 KB, 900x1223)
185 KB
185 KB PNG
I've set up a bioinformatics toolchain that I run on my arch linux machine.

The workflow is managed by two shell scripts, one does a fetch, and is called by the other that runs the tools.
you input the organism name in the command, then it runs for a long time and outputs reference genomes, annotations, filtered protein sequences, designed backbones + sequences + predicted structures, validation results, logs, and a README summary
The steps it goes through and the tools are:
Genome download (RefSeq preferred to GenBank to clustered fallback) + header cleanup
Quality control (adapter trimming, basic stats) BBTools (bbduk)
Repeat masking RepeatModeler, RepeatMasker
Gene prediction: BRAKER3 for eukaryotes (OrthoDB + optional RNA hints) or Prokka for prokaryotes
Protein extraction/filtering (longest isoform, ≥100 aa prok / >200 aa euk) gffread and biopython
Backbone generation with RFdiffusion (de novo or motif-scaffolded)
Sequence design using ProteinMPNN (8 variants per backbone)
Batch structure prediction with ColabFold (AlphaFold2)
Remote BLASTp validation against nr (top hit recorded)
Functional renaming based on BLAST inference
Markdown report
(managed in Conda envs)
Also plasmid design with MoClo, domestication and codon specificity for organisms.
The toolchain automates the entire process from a name through reference genome retrieval, quality control and annotation, protein extraction, AI-driven de novo protein design (backbones and sequences), structure prediction, BLAST-based validation, functional renaming, and generation of a clear Markdown report. Also an electron gui for directing other programs to visualize the output, making queries in a search bar, and a fallback 3dmol.js viewer.
>>
so you are basically God now



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.