simplifying code and adding resources.yml #10

antgonza · 2025-10-28T19:12:59Z

No description provided.

lucaspatel

Some minor concerns about testing but otherwise looks good to me.

lucaspatel · 2025-11-10T20:35:50Z

qp_pacbio/data/templates/2.get-circular-genomes.sbatch

+cd ../step-2/${sample_name}_split
+# making a copy of the small_LCG before they are removed
+mkdir -p {{output}}/step-2/${sample_name}_small_LCG
+find . -maxdepth 1 -type f -size -512k -print0 | xargs -0 -r cp -t ../${sample_name}_small_LCG


So small_LCG is defined by files in size < 512kb? Probably important to note in the documentation for the PacBio workflow.

I was expecting small_LCG to be defined by total genome size

@jianshu93, can you comment?

by file size for now. Can be optimized, they are proportational to total genome size.

approximate 515,000 bases (half a million), because one character takes one byte approximately.

lucaspatel · 2025-11-10T20:45:57Z

qp_pacbio/scripts.py

+            else:
+                full = full.concat(loaded)
+
+            with h5py.File(f"{base}/{rank}", "w") as out:


Not sure if I'm reading this right, but wouldn't these lines here indicate that you will write full to out on every iteration of the loop, thus rewriting the same path over and over?

lucaspatel · 2025-11-10T20:46:26Z

qp_pacbio/scripts.py

+
+@click.command()
+@click.option("--base", type=click.Path(exists=True), required=True)
+def biom_merge(base):


This seems like a lot of logic to merge some BIOM. Is this borrowed from qp-woltka? Do you have tests supporting this function in particular?

lucaspatel · 2025-11-10T20:47:15Z

pyproject.toml

    "qiita-files@https://github.com/qiita-spots/qiita-files/archive/master.zip",
    "qiita_client@https://github.com/qiita-spots/qiita_client/archive/master.zip",
+    "woltka@git+https://github.com/qiyunzhu/woltka.git#egg=woltka",
+    "micov@git+https://github.com/biocore/micov.git#egg=micov",


Minor but you can probably get micov from pip now.

It would be nice if we can install from bioconda

jianshu93

I like the database option. We will probably update it very soon. wol3 or even GTDB.

simplifying code and adding resources.yml

9464d2b

antgonza changed the title ~~simplifying code and adding resources.yml~~ [WIP]: simplifying code and adding resources.yml Oct 28, 2025

antgonza added 28 commits October 28, 2025 13:18

export ENVIRONMEN sooner in the script

a6ffcc5

add ENVIRONMENT in qp-pacbio yml

b371545

afterok -> afterany

a2077b0

CONDA_ENVIRONMENT

77fc60e

fix tests

6e05385

CONDA_ENVIRONMENT

766bec5

-J me_

b6032b8

adding missing params for merge

ffdae88

data

43a931f

mv data to qp_pacbio

5cf7de8

find_base_path

d11ba2a

--ignore=qp_pacbio/data

8e6267a

"results": result_fp,

0071e1c

results -> result_fp

80373a0

add completed

50a46cf

output -> out_dir

9a44417

SLURM_ARRAY_JOB_ID->SLURM_ARRAY_TASK_ID

fb42f44

rm extra hifiasm_meta

bdb82df

validate failed_steps

0cd35e7

rm shopt

a58a1f2

add file check FILES=(*.fa)

d2b227c

save small LCGs

741e242

update databases

52d76b2

forgot 1 update

6af6c17

rm extra /

f98217e

update minimap2 woltka command

af437dd

nprocs -> 16

b9b5c76

biom_merge_pacbio

e58bb16

antgonza added 9 commits November 5, 2025 19:43

woltka & biom

9098888

micov

15bbe41

pip https -> git

a2775aa

90 ->150

c6193a5

fix test

23fdf69

readd lcg_folder

9d7fcb8

add finish_qp_pacbio to woltka

e82c2f3

missing new line

2286019

_small_LCGs -> _small_LCG

813d03b

antgonza changed the title ~~[WIP]: simplifying code and adding resources.yml~~ simplifying code and adding resources.yml Nov 7, 2025

antgonza requested review from jianshu93 and lucaspatel November 7, 2025 14:14

antgonza added 2 commits November 10, 2025 09:18

09 -> 11 and default_params_set

4a1fd74

default params should be a dict

20be910

lucaspatel reviewed Nov 10, 2025

View reviewed changes

jianshu93 approved these changes Nov 10, 2025

View reviewed changes

antgonza added 3 commits November 11, 2025 13:03

fixes after more testing

75344ca

fix tests

f694914

rm >

b203403

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

simplifying code and adding resources.yml #10

simplifying code and adding resources.yml #10

antgonza commented Oct 28, 2025

Uh oh!

lucaspatel left a comment

Uh oh!

lucaspatel Nov 10, 2025

Uh oh!

lucaspatel Nov 10, 2025

Uh oh!

antgonza Nov 10, 2025

Uh oh!

jianshu93 Nov 11, 2025

Uh oh!

jianshu93 Nov 11, 2025

Uh oh!

lucaspatel Nov 10, 2025

Uh oh!

lucaspatel Nov 10, 2025

Uh oh!

lucaspatel Nov 10, 2025

Uh oh!

jianshu93 Nov 10, 2025

Uh oh!

jianshu93 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

simplifying code and adding resources.yml #10

Are you sure you want to change the base?

simplifying code and adding resources.yml #10

Conversation

antgonza commented Oct 28, 2025

Uh oh!

lucaspatel left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jianshu93 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants