Directory Organization Exercise
Aims
Directory organisation has become more necessary, due the multiple intermediate files that Genomics pipelines produce, and the many output files they produce. Which results file corresponds to which sample? To which replicate?
This exercise bundles 16 projects, packed, in their turn, into 16 zip files.
Data taken from the excellent book "Computational Genomics" by Nello Cristianini and Matthew Hahn (ref. http://www.computational-genomics.net).
Commands
(There are several ways to undertake this task, but this one aims to make use of TAB-COMPLETION keys and HISTORY. This is the streamlined version, not that one followed during the course, as it does not require the "heavy lifting" tools of "find" and "vim".
-
cdto ensure we are in our home directory -
cp $TCH/allprojs.tar .to copy the bundle with all the projects to our home page.
-
tar -tf allprojs.tarto look inside the tar ("tape archive" file) which contains all the zip files. -
tar -xf allprojs.tarto extract them all -
lsto make sure they have been extracted. -
rm allprojs.tar, by which we delete the tar file, because we have extracted all its contents. -
for i in $(ls *.zip); do mkdir ${i%.*}; doneby which we create the directories into which we plan to mv the zip files -
for i in $(ls -d */); do cd $i; unzip ${i%/*}.zip; cd ..;done