ANONYMOUS wrote:
> Hi Michael,
>
> I have read the threads regarding this topic but I am still a bit confused and want to clarify what breaches_per_month uses as input.
>
> Firstly, preprocess takes the input file Cyber_Security_Breaches_noym.tsv as input. This then outputs a cleaned data file to stdout (perhaps using cat clean_data_file.tsv). Please correct me if this is wrong.
>
> Then, breaches_per_month takes a .tsv file as input, (which for testing purposes you will run "preprocess Cyber_Security_Breaches_noym.tsv" and redirect the output to breaches_per_month). However, we should design breaches_per_month to take an input file (say clean_data_file.tsv) as its argument? If this is not the case, then how will you redirect the stdout so breaches_per_month can accomodate for this.
>
> Cheers
Hi,
That is precisely what I intend. The first arugment for both scripts is a file. In the first case it's a data file that needs to be clean, but it's up to the user to specify what that file is. Similarly, breaches_per_month takes an input file which you can assume to contain cleaned data, but, again, it is up to the user as to what file that will be.
Makes sense?
Cheers
MichaelW