You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: presentation/hybseq_course.tex
+11-14Lines changed: 11 additions & 14 deletions
Original file line number
Diff line number
Diff line change
@@ -110,7 +110,7 @@ \section{Introduction}
110
110
\begin{itemize}
111
111
\item Course \href{https://github.com/V-Z/hybseq-course}{Git (slides with all links)} and \href{https://is.cuni.cz/studium/eng/predmety/index.php?do=predmet&kod=MB120C117}{information in SIS} (\href{https://is.cuni.cz/studium/predmety/index.php?do=predmet&kod=MB120C117}{česky})
112
112
\item\alert{Scripts} \url{https://github.com/V-Z/hybseq-scripts} \textbf{shown in the course} --- clone the Git repository and adapt to your needs
113
-
\item Download presentation from \url{https://trapa.cz/en/hybseq-course-2024}
113
+
\item Download presentation from \url{https://trapa.cz/en/hybseq-course-2025}
114
114
\item Most of the work is done in Linux/UNIX (macOS,~\ldots) command line, so that good knowledge of work in command line is essential, good starting point can be my Linux and MetaCentrum course \url{https://soubory.trapa.cz/linuxcourse/}
115
115
\item Many tasks are done in R, so that at least basic knowledge of R is needed, good starting point can be my R course \url{https://soubory.trapa.cz/rcourse/}
116
116
\item Processing HybSeq data is computationally demanding (it requires plenty of resources), during the course we use \href{https://www.metacentrum.cz/en/Sluzby/Grid/}{MetaCentrum, Czech National Grid Infrastructure} (\href{https://www.metacentrum.cz/cs/Sluzby/Grid/}{česky}) (slide~\ref{CESNET}), but any computing cluster or powerful desktop (for patient users;-) can be used
@@ -216,7 +216,7 @@ \subsection{Software needed}
216
216
\item\href{https://blast.ncbi.nlm.nih.gov/doc/blast-help/downloadblastdata.html\#downloadblastdata}{BLAST+} (used by \href{https://github.com/mossmatters/HybPiper/wiki}{HybPiper})
217
217
\item\href{https://github.com/lh3/bwa}{BWA} (used by \href{https://github.com/mossmatters/HybPiper/wiki}{HybPiper})
218
218
\item\href{https://www.wsi.uni-tuebingen.de/lehrstuehle/algorithms-in-bioinformatics/software/dendroscope/}{Dendroscope} --- visualize outputs of \href{https://phylogenomics.rice.edu/html/phylonet.html}{PhyloNet}
219
-
\item\href{https://www.ebi.ac.uk/about/vertebrate-genomics/software/exonerate}{Exonerate} (used by \href{https://github.com/mossmatters/HybPiper/wiki}{HybPiper})
219
+
\item\href{https://github.com/nathanweeks/exonerate}{Exonerate} (used by \href{https://github.com/mossmatters/HybPiper/wiki}{HybPiper})
220
220
\item\href{https://www.gnu.org/software/parallel/}{GNU Parallel} (used by \href{https://github.com/mossmatters/HybPiper/wiki}{HybPiper} and in BASH scripts)
221
221
\item\href{https://github.com/mossmatters/HybPiper/wiki}{HybPiper} --- recovering genes from targeted sequence capture data
222
222
\item\href{http://www.iqtree.org/}{IQ-TREE} --- gene trees
\item Information about data storage \url{https://du.cesnet.cz/en/start} (\href{https://du.cesnet.cz/cs/start}{česky}) contains detailed usage instructions
253
253
\item Information about MetaCentrum \url{https://www.metacentrum.cz/en/} (\href{https://www.metacentrum.cz/cs/}{česky})
254
254
\item Most of practical information for users are at \url{https://docs.metacentrum.cz/}
255
-
\begin{itemize}
256
-
\item Old \href{https://wiki.metacentrum.cz/}{wiki} is deprecated
257
-
\end{itemize}
258
-
\item To start work see at least \href{https://docs.metacentrum.cz/access/log-in/}{access}, \href{https://docs.metacentrum.cz/computing/}{computing}, \href{https://docs.metacentrum.cz/data/data-within/}{work with data} and some \href{https://docs.metacentrum.cz/tutorials/}{tutorial}
255
+
\item To start work see at least \href{https://docs.metacentrum.cz/en/docs/access/log-in}{access}, \href{https://docs.metacentrum.cz/en/docs/computing/concepts}{computing}, \href{https://docs.metacentrum.cz/en/docs/data/large-data}{work with data} and some \href{https://docs.metacentrum.cz/en/docs/tutorials}{tutorial}
259
256
\item Of course, good knowledge of work in Linux command line (BASH) is needed\ldots
\item Current state and usage as available at \url{https://metavo.metacentrum.cz/}
276
273
\item Manage your user account at \url{http://metavo.metacentrum.cz/en/myaccount/} (\href{https://metavo.metacentrum.cz/cs/myaccount/}{česky})
277
274
\item Personal view on actual resources and running tasks is at \url{https://metavo.metacentrum.cz/pbsmon2/person}
278
-
\item Listing of available applications \url{https://docs.metacentrum.cz/software/search-soft/}
279
-
\item It has several \href{https://docs.metacentrum.cz/access/log-in/}{front ends} where users log, various \href{https://docs.metacentrum.cz/data/data-within/}{storages}, and thousands of computers (nodes) doing the calculations --- they are not accessed directly to run task
275
+
\item Listing of available applications \url{https://docs.metacentrum.cz/en/docs/software/alphabet}
276
+
\item It has several \href{https://docs.metacentrum.cz/en/docs/access/log-in}{front ends} where users log, various \href{https://docs.metacentrum.cz/en/docs/data/large-data}{storages}, and thousands of computers (nodes) doing the calculations --- they are not accessed directly to run task
280
277
\begin{itemize}
281
278
\item Distributed nature and number of front ends and storages may be confusing for beginners
\item Personal view \url{https://metavo.metacentrum.cz/pbsmon2/person} has nice overview of available resources and tasks and allows comfortable construction of submission command
\item Tasks (BASH scripts) are submitted for computing by \texttt{qsub} --- the script must copy the data into \texttt{\$SCRATCHDIR} and do all calculations there
318
315
\begin{itemize}
319
-
\item It has plenty of options how to specify requirements (see \href{https://docs.metacentrum.cz/computing/run-basic-job/}{help})
316
+
\item It has plenty of options how to specify requirements (see \href{https://docs.metacentrum.cz/en/docs/computing/run-basic-job}{help})
320
317
\end{itemize}
321
318
\item Queued and running jobs can be seen by \texttt{qstat -u \$USER} (\texttt{qstat} has much more options) and any job can be terminated by \texttt{qdel 123456789} (number from \texttt{qstat})
322
319
\end{itemize}
@@ -419,7 +416,7 @@ \subsection{Data download and start}
419
416
\begin{frame}[fragile]{Data download}
420
417
\label{datadownload}
421
418
\begin{itemize}
422
-
\item MetaCentrum storages have sometimes too limited quota for such a large data --- see your \href{https://metavo.metacentrum.cz/en/myaccount/kvoty}{quotas} (\href{http://metavo.metacentrum.cz/cs/myaccount/kvoty}{česky}), see \href{https://docs.metacentrum.cz/data/quotas/}{details}
419
+
\item MetaCentrum storages have sometimes too limited quota for such a large data --- see your \href{https://metavo.metacentrum.cz/en/myaccount/kvoty}{quotas} (\href{http://metavo.metacentrum.cz/cs/myaccount/kvoty}{česky}), see \href{https://docs.metacentrum.cz/en/docs/data/quotas}{details}
423
420
\item The \href{https://github.com/V-Z/hybseq-scripts}{pipeline} produce a lot of data (especially \href{https://github.com/mossmatters/HybPiper/}{HybPiper}; gene trees can be also large) --- ensure to have enough space to store everything
424
421
\begin{itemize}
425
422
\item Especially \textbf{HybPiper produces a lot of files} --- \alert{user may reach quota for number of files}, not necessarily (only) amount of data
0 commit comments