How to delete every two lines after 3rd lines in a file contains very large number of lines? The Next CEO of Stack OverflowHow to print lines number 15 and 25 out of each 50 lines?AWK command failing for large fileextract every nth character from a stringawk manipulationSum of alternate values in a column using either sed or nawkCheck if two lines start with the same character, if so the output average, if not, print actual valuehow to use awk to do subtraction with numbers in a large fileHow to take the values from two columns in a txt file and match them to values in anotherHow to find the min of a column in every nth intervals of a file, using sed, sort, tail?Remove the line if a field of the line exists in another fileHow to aggregate the below records using awk command

Writing differences on a blackboard

Math-accent symbol over parentheses enclosing accented symbol (amsmath)

Flying from Cape Town to England and return to another province

Method for adding error messages to a dictionary given a key

Does increasing your ability score affect your main stat?

Domestic-to-international connection at Orlando (MCO)

Grabbing quick drinks

Can MTA send mail via a relay without being told so?

Rotate a column

Proper way to express "He disappeared them"

What flight has the highest ratio of timezone difference to flight time?

What connection does MS Office have to Netscape Navigator?

How to avoid supervisors with prejudiced views?

Why do remote US companies require working in the US?

Does soap repel water?

How to edit “Name” property in GCI output?

Is it ever safe to open a suspicious HTML file (e.g. email attachment)?

What was the first Unix version to run on a microcomputer?

Can you be charged for obstruction for refusing to answer questions?

Help understanding this unsettling image of Titan, Epimetheus, and Saturn's rings?

Is there always a complete, orthogonal set of unitary matrices?

Why doesn't UK go for the same deal Japan has with EU to resolve Brexit?

Is there a way to save my career from absolute disaster?

How to prove a simple equation?



How to delete every two lines after 3rd lines in a file contains very large number of lines?



The Next CEO of Stack OverflowHow to print lines number 15 and 25 out of each 50 lines?AWK command failing for large fileextract every nth character from a stringawk manipulationSum of alternate values in a column using either sed or nawkCheck if two lines start with the same character, if so the output average, if not, print actual valuehow to use awk to do subtraction with numbers in a large fileHow to take the values from two columns in a txt file and match them to values in anotherHow to find the min of a column in every nth intervals of a file, using sed, sort, tail?Remove the line if a field of the line exists in another fileHow to aggregate the below records using awk command










3
















Like

If I have :



1st line (keep) 
2nd line (keep)
3rd line (keep)
4rth lines (delete)
5th (del)
6th (keep)
7nth (keep)
8th lines (keep)
9th (del)
10th (del)
11th (keep)
12th (keep)
13th (keep)
14th (del)
15th (del)


etc....










share|improve this question









New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1





    increment a line counter (zero-indexed) for each line read, print when (line counter modulo 5>=3)

    – ChuckCottrill
    yesterday











  • can you please clarify more,

    – Jaguar Jom
    yesterday






  • 3





    Possible duplicate of How to print lines number 15 and 25 out of each 50 lines?

    – Sundeep
    21 hours ago






  • 1





    the duplicate is slightly worded differently, but it is the same looked in a different way.. this question would be print lines 1,2,3 out of each 5 lines for ex: seq 15 | awk 'BEGIN a[1] a[2] a[3] ; NR % 5 in a' and seq 15 | sed -n 'p;n;p;n;p;n;n'

    – Sundeep
    21 hours ago












  • also, the sed version above might be faster than the awk one for large files

    – Sundeep
    21 hours ago















3
















Like

If I have :



1st line (keep) 
2nd line (keep)
3rd line (keep)
4rth lines (delete)
5th (del)
6th (keep)
7nth (keep)
8th lines (keep)
9th (del)
10th (del)
11th (keep)
12th (keep)
13th (keep)
14th (del)
15th (del)


etc....










share|improve this question









New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1





    increment a line counter (zero-indexed) for each line read, print when (line counter modulo 5>=3)

    – ChuckCottrill
    yesterday











  • can you please clarify more,

    – Jaguar Jom
    yesterday






  • 3





    Possible duplicate of How to print lines number 15 and 25 out of each 50 lines?

    – Sundeep
    21 hours ago






  • 1





    the duplicate is slightly worded differently, but it is the same looked in a different way.. this question would be print lines 1,2,3 out of each 5 lines for ex: seq 15 | awk 'BEGIN a[1] a[2] a[3] ; NR % 5 in a' and seq 15 | sed -n 'p;n;p;n;p;n;n'

    – Sundeep
    21 hours ago












  • also, the sed version above might be faster than the awk one for large files

    – Sundeep
    21 hours ago













3












3








3


0







Like

If I have :



1st line (keep) 
2nd line (keep)
3rd line (keep)
4rth lines (delete)
5th (del)
6th (keep)
7nth (keep)
8th lines (keep)
9th (del)
10th (del)
11th (keep)
12th (keep)
13th (keep)
14th (del)
15th (del)


etc....










share|improve this question









New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.













Like

If I have :



1st line (keep) 
2nd line (keep)
3rd line (keep)
4rth lines (delete)
5th (del)
6th (keep)
7nth (keep)
8th lines (keep)
9th (del)
10th (del)
11th (keep)
12th (keep)
13th (keep)
14th (del)
15th (del)


etc....







bash shell awk sed






share|improve this question









New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question









New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question








edited 27 mins ago









Prvt_Yadv

3,00031328




3,00031328






New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked yesterday









Jaguar JomJaguar Jom

161




161




New contributor




Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






Jaguar Jom is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







  • 1





    increment a line counter (zero-indexed) for each line read, print when (line counter modulo 5>=3)

    – ChuckCottrill
    yesterday











  • can you please clarify more,

    – Jaguar Jom
    yesterday






  • 3





    Possible duplicate of How to print lines number 15 and 25 out of each 50 lines?

    – Sundeep
    21 hours ago






  • 1





    the duplicate is slightly worded differently, but it is the same looked in a different way.. this question would be print lines 1,2,3 out of each 5 lines for ex: seq 15 | awk 'BEGIN a[1] a[2] a[3] ; NR % 5 in a' and seq 15 | sed -n 'p;n;p;n;p;n;n'

    – Sundeep
    21 hours ago












  • also, the sed version above might be faster than the awk one for large files

    – Sundeep
    21 hours ago












  • 1





    increment a line counter (zero-indexed) for each line read, print when (line counter modulo 5>=3)

    – ChuckCottrill
    yesterday











  • can you please clarify more,

    – Jaguar Jom
    yesterday






  • 3





    Possible duplicate of How to print lines number 15 and 25 out of each 50 lines?

    – Sundeep
    21 hours ago






  • 1





    the duplicate is slightly worded differently, but it is the same looked in a different way.. this question would be print lines 1,2,3 out of each 5 lines for ex: seq 15 | awk 'BEGIN a[1] a[2] a[3] ; NR % 5 in a' and seq 15 | sed -n 'p;n;p;n;p;n;n'

    – Sundeep
    21 hours ago












  • also, the sed version above might be faster than the awk one for large files

    – Sundeep
    21 hours ago







1




1





increment a line counter (zero-indexed) for each line read, print when (line counter modulo 5>=3)

– ChuckCottrill
yesterday





increment a line counter (zero-indexed) for each line read, print when (line counter modulo 5>=3)

– ChuckCottrill
yesterday













can you please clarify more,

– Jaguar Jom
yesterday





can you please clarify more,

– Jaguar Jom
yesterday




3




3





Possible duplicate of How to print lines number 15 and 25 out of each 50 lines?

– Sundeep
21 hours ago





Possible duplicate of How to print lines number 15 and 25 out of each 50 lines?

– Sundeep
21 hours ago




1




1





the duplicate is slightly worded differently, but it is the same looked in a different way.. this question would be print lines 1,2,3 out of each 5 lines for ex: seq 15 | awk 'BEGIN a[1] a[2] a[3] ; NR % 5 in a' and seq 15 | sed -n 'p;n;p;n;p;n;n'

– Sundeep
21 hours ago






the duplicate is slightly worded differently, but it is the same looked in a different way.. this question would be print lines 1,2,3 out of each 5 lines for ex: seq 15 | awk 'BEGIN a[1] a[2] a[3] ; NR % 5 in a' and seq 15 | sed -n 'p;n;p;n;p;n;n'

– Sundeep
21 hours ago














also, the sed version above might be faster than the awk one for large files

– Sundeep
21 hours ago





also, the sed version above might be faster than the awk one for large files

– Sundeep
21 hours ago










6 Answers
6






active

oldest

votes


















10














Try:



awk '(NR-1)%5<3' file


For example:



$ awk '(NR-1)%5<3' file
1st line (keep)
2nd line (keep)
3rd line (keep)
6th (keep)
7nth (keep)
8th lines (keep)
11th (keep)
12th (keep)
13th (keep)


How it works



The command (NR-1)%5<3 tells awk to print any line for which (NR-1)%5<3 is true. In awk, NR is the line number with the first line counting as 1. For every five lines in the file, that statement will be true for the first three.






share|improve this answer
































    4














    Basically, you want something like 'Fizz-Buzz' in awk ...



    awk ' if (i++%5 < 3) print $0;'


    To show this works...



    for x in 1 2 3 4 5 6 7 8 9 10 ; do echo $x; done |
    awk ' if (i++%5 < 3) print $0;'


    When your file is named, 'mybigfile.csv',



    awk ' if (i++%5 < 3) print $0;' < mybigfile.csv > mybigfile-123.csv





    share|improve this answer























    • You could use NR, or just rely on i defaulting to zero :-) (code golf)

      – ChuckCottrill
      yesterday


















    4














    A simple command is:



    awk 'if((NR-1) % 5<=2)print $0' file


    It will only print first 3 lines in sequence of 5 lines. Because (NR-1)%5 will give output like 0 1 2 3 4, and first 3 lines are less than equal to 2. So it will only print them.



    I have file with contents:



    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15


    The output is:



    1
    2
    3
    6
    7
    8
    11
    12
    13


    Or as suggested in comments you can use:



    awk '(NR - 1) % 5 <= 2' file





    share|improve this answer




















    • 2





      Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

      – Kusalananda
      20 hours ago












    • Thanks I didnt know it.

      – Prvt_Yadv
      19 hours ago


















    4














    A generic solution for masking out a particular pattern of lines from a file:



    #!/bin/sh

    # The pattern is given on the command line.
    pattern=$1

    # The period is simply the length of the pattern.
    period=$#pattern

    # Use bc to convert the binary pattern to an integer.
    mask=$( printf 'ibase=2; %sn' "$pattern" | bc )

    awk -v mask="$mask" -v period="$period" '
    BEGIN p = lshift(1, period-1)
    and(rshift(p, (FNR-1) % period), mask)'


    This relies on awk implementing the non-standard functions and() (bitwise AND), rshift() and lshift() (bitwise right and left shift), which both GNU awk and some BSD implementations of awk does, but not mawk.



    This takes a pattern, which is a binary number representing both the cyclic period and what lines within each period should be kept or masked out. A 1 means "keep" and a 0 means "delete".



    For example: The pattern of line that should be applied in your question is 11100, which means "for each set of five lines, keep the first three and delete the others".



    Using 01001000 would delete all but the 2nd and 5th lines in every 8 lines.



    The awk program could also be written without the BEGIN block as



    and(lshift(1, (period-1) - (FNR-1) % period), mask)


    Left-shifting 1 by (period-1) - (FNR-1) % period positions is the same as calculating 2 to that power, but I'm using lshift() since awk does its arithmetics using floating point operations rather than in exact integer arithmetics.



    Since the code relies on the binary representation of the pattern, very long patterns may not work well.



    Testing:



    Removing the lines you want to remove:



    $ sh script.sh 11100 <file
    1st line (keep)
    2nd line (keep)
    3rd line (keep)
    6th (keep)
    7nth (keep)
    8th lines (keep)
    11th (keep)
    12th (keep)
    13th (keep)


    Inverting the pattern:



    $ sh script.sh 00011 <file
    4rth lines (delete)
    5th (del)
    9th (del)
    10th (del)
    14th (del)
    15th (del)





    share|improve this answer
































      4














      This can be solved using GNU sed:



      sed '4~5,5~5d' file


      Note that this uses a GNU-specific extension to the sed standard, and thus doesn't work with e.g. BSD sed on macOS. However, GNU sed can be installed on macOS using brew, after which it can be used as gsed. On Linux, GNU sed is the default.



      This prints every line that does not fall in the fourth till fifth line of every five lines; for a clearer example: sed '3~10,6~10d' fill select lines 1, 2, 7, 8, 9, 10 of every group of 10 lines by deleting lines 3 till 6.



      The top-voted answer suggests using awk '(NR-1)%5<3'. On my machine, on a file containing the numbers 1 till 2 million, this takes about 0.6 seconds, while the sed solution in this answer takes about 0.35 seconds. This is reasonable, since sed is in general a simpler tool, and can thus work faster than the more complicated, but more full-featured, awk.






      share|improve this answer








      New contributor




      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.















      • 2





        +1 ... or 4~5N;d;

        – steeldriver
        13 hours ago


















      0














      Tried with below command and it worked fine



      for((i=1;i<=20;i++)); do j=$(($i+2)); sed -n ''$i','$j'p' filename;i=$(($j+2)); done


      output



      1st line (keep)
      2nd line (keep)
      3rd line (keep)
      6th (keep)
      7nth (keep)
      8th lines (keep)
      11th (keep)
      12th (keep)
      13th (keep)





      share|improve this answer


















      • 1





        That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

        – Law29
        17 hours ago











      Your Answer








      StackExchange.ready(function()
      var channelOptions =
      tags: "".split(" "),
      id: "106"
      ;
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function()
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled)
      StackExchange.using("snippets", function()
      createEditor();
      );

      else
      createEditor();

      );

      function createEditor()
      StackExchange.prepareEditor(
      heartbeatType: 'answer',
      autoActivateHeartbeat: false,
      convertImagesToLinks: false,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: null,
      bindNavPrevention: true,
      postfix: "",
      imageUploader:
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      ,
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      );



      );






      Jaguar Jom is a new contributor. Be nice, and check out our Code of Conduct.









      draft saved

      draft discarded


















      StackExchange.ready(
      function ()
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f509553%2fhow-to-delete-every-two-lines-after-3rd-lines-in-a-file-contains-very-large-numb%23new-answer', 'question_page');

      );

      Post as a guest















      Required, but never shown

























      6 Answers
      6






      active

      oldest

      votes








      6 Answers
      6






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      10














      Try:



      awk '(NR-1)%5<3' file


      For example:



      $ awk '(NR-1)%5<3' file
      1st line (keep)
      2nd line (keep)
      3rd line (keep)
      6th (keep)
      7nth (keep)
      8th lines (keep)
      11th (keep)
      12th (keep)
      13th (keep)


      How it works



      The command (NR-1)%5<3 tells awk to print any line for which (NR-1)%5<3 is true. In awk, NR is the line number with the first line counting as 1. For every five lines in the file, that statement will be true for the first three.






      share|improve this answer





























        10














        Try:



        awk '(NR-1)%5<3' file


        For example:



        $ awk '(NR-1)%5<3' file
        1st line (keep)
        2nd line (keep)
        3rd line (keep)
        6th (keep)
        7nth (keep)
        8th lines (keep)
        11th (keep)
        12th (keep)
        13th (keep)


        How it works



        The command (NR-1)%5<3 tells awk to print any line for which (NR-1)%5<3 is true. In awk, NR is the line number with the first line counting as 1. For every five lines in the file, that statement will be true for the first three.






        share|improve this answer



























          10












          10








          10







          Try:



          awk '(NR-1)%5<3' file


          For example:



          $ awk '(NR-1)%5<3' file
          1st line (keep)
          2nd line (keep)
          3rd line (keep)
          6th (keep)
          7nth (keep)
          8th lines (keep)
          11th (keep)
          12th (keep)
          13th (keep)


          How it works



          The command (NR-1)%5<3 tells awk to print any line for which (NR-1)%5<3 is true. In awk, NR is the line number with the first line counting as 1. For every five lines in the file, that statement will be true for the first three.






          share|improve this answer















          Try:



          awk '(NR-1)%5<3' file


          For example:



          $ awk '(NR-1)%5<3' file
          1st line (keep)
          2nd line (keep)
          3rd line (keep)
          6th (keep)
          7nth (keep)
          8th lines (keep)
          11th (keep)
          12th (keep)
          13th (keep)


          How it works



          The command (NR-1)%5<3 tells awk to print any line for which (NR-1)%5<3 is true. In awk, NR is the line number with the first line counting as 1. For every five lines in the file, that statement will be true for the first three.







          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited 20 hours ago









          Kusalananda

          138k17258428




          138k17258428










          answered yesterday









          John1024John1024

          48.1k5113128




          48.1k5113128























              4














              Basically, you want something like 'Fizz-Buzz' in awk ...



              awk ' if (i++%5 < 3) print $0;'


              To show this works...



              for x in 1 2 3 4 5 6 7 8 9 10 ; do echo $x; done |
              awk ' if (i++%5 < 3) print $0;'


              When your file is named, 'mybigfile.csv',



              awk ' if (i++%5 < 3) print $0;' < mybigfile.csv > mybigfile-123.csv





              share|improve this answer























              • You could use NR, or just rely on i defaulting to zero :-) (code golf)

                – ChuckCottrill
                yesterday















              4














              Basically, you want something like 'Fizz-Buzz' in awk ...



              awk ' if (i++%5 < 3) print $0;'


              To show this works...



              for x in 1 2 3 4 5 6 7 8 9 10 ; do echo $x; done |
              awk ' if (i++%5 < 3) print $0;'


              When your file is named, 'mybigfile.csv',



              awk ' if (i++%5 < 3) print $0;' < mybigfile.csv > mybigfile-123.csv





              share|improve this answer























              • You could use NR, or just rely on i defaulting to zero :-) (code golf)

                – ChuckCottrill
                yesterday













              4












              4








              4







              Basically, you want something like 'Fizz-Buzz' in awk ...



              awk ' if (i++%5 < 3) print $0;'


              To show this works...



              for x in 1 2 3 4 5 6 7 8 9 10 ; do echo $x; done |
              awk ' if (i++%5 < 3) print $0;'


              When your file is named, 'mybigfile.csv',



              awk ' if (i++%5 < 3) print $0;' < mybigfile.csv > mybigfile-123.csv





              share|improve this answer













              Basically, you want something like 'Fizz-Buzz' in awk ...



              awk ' if (i++%5 < 3) print $0;'


              To show this works...



              for x in 1 2 3 4 5 6 7 8 9 10 ; do echo $x; done |
              awk ' if (i++%5 < 3) print $0;'


              When your file is named, 'mybigfile.csv',



              awk ' if (i++%5 < 3) print $0;' < mybigfile.csv > mybigfile-123.csv






              share|improve this answer












              share|improve this answer



              share|improve this answer










              answered yesterday









              ChuckCottrillChuckCottrill

              722814




              722814












              • You could use NR, or just rely on i defaulting to zero :-) (code golf)

                – ChuckCottrill
                yesterday

















              • You could use NR, or just rely on i defaulting to zero :-) (code golf)

                – ChuckCottrill
                yesterday
















              You could use NR, or just rely on i defaulting to zero :-) (code golf)

              – ChuckCottrill
              yesterday





              You could use NR, or just rely on i defaulting to zero :-) (code golf)

              – ChuckCottrill
              yesterday











              4














              A simple command is:



              awk 'if((NR-1) % 5<=2)print $0' file


              It will only print first 3 lines in sequence of 5 lines. Because (NR-1)%5 will give output like 0 1 2 3 4, and first 3 lines are less than equal to 2. So it will only print them.



              I have file with contents:



              1
              2
              3
              4
              5
              6
              7
              8
              9
              10
              11
              12
              13
              14
              15


              The output is:



              1
              2
              3
              6
              7
              8
              11
              12
              13


              Or as suggested in comments you can use:



              awk '(NR - 1) % 5 <= 2' file





              share|improve this answer




















              • 2





                Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

                – Kusalananda
                20 hours ago












              • Thanks I didnt know it.

                – Prvt_Yadv
                19 hours ago















              4














              A simple command is:



              awk 'if((NR-1) % 5<=2)print $0' file


              It will only print first 3 lines in sequence of 5 lines. Because (NR-1)%5 will give output like 0 1 2 3 4, and first 3 lines are less than equal to 2. So it will only print them.



              I have file with contents:



              1
              2
              3
              4
              5
              6
              7
              8
              9
              10
              11
              12
              13
              14
              15


              The output is:



              1
              2
              3
              6
              7
              8
              11
              12
              13


              Or as suggested in comments you can use:



              awk '(NR - 1) % 5 <= 2' file





              share|improve this answer




















              • 2





                Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

                – Kusalananda
                20 hours ago












              • Thanks I didnt know it.

                – Prvt_Yadv
                19 hours ago













              4












              4








              4







              A simple command is:



              awk 'if((NR-1) % 5<=2)print $0' file


              It will only print first 3 lines in sequence of 5 lines. Because (NR-1)%5 will give output like 0 1 2 3 4, and first 3 lines are less than equal to 2. So it will only print them.



              I have file with contents:



              1
              2
              3
              4
              5
              6
              7
              8
              9
              10
              11
              12
              13
              14
              15


              The output is:



              1
              2
              3
              6
              7
              8
              11
              12
              13


              Or as suggested in comments you can use:



              awk '(NR - 1) % 5 <= 2' file





              share|improve this answer















              A simple command is:



              awk 'if((NR-1) % 5<=2)print $0' file


              It will only print first 3 lines in sequence of 5 lines. Because (NR-1)%5 will give output like 0 1 2 3 4, and first 3 lines are less than equal to 2. So it will only print them.



              I have file with contents:



              1
              2
              3
              4
              5
              6
              7
              8
              9
              10
              11
              12
              13
              14
              15


              The output is:



              1
              2
              3
              6
              7
              8
              11
              12
              13


              Or as suggested in comments you can use:



              awk '(NR - 1) % 5 <= 2' file






              share|improve this answer














              share|improve this answer



              share|improve this answer








              edited 19 hours ago

























              answered yesterday









              Prvt_YadvPrvt_Yadv

              3,00031328




              3,00031328







              • 2





                Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

                – Kusalananda
                20 hours ago












              • Thanks I didnt know it.

                – Prvt_Yadv
                19 hours ago












              • 2





                Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

                – Kusalananda
                20 hours ago












              • Thanks I didnt know it.

                – Prvt_Yadv
                19 hours ago







              2




              2





              Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

              – Kusalananda
              20 hours ago






              Or, with idiomatic use of awk syntax: awk '(NR - 1) % 5 <= 2' file

              – Kusalananda
              20 hours ago














              Thanks I didnt know it.

              – Prvt_Yadv
              19 hours ago





              Thanks I didnt know it.

              – Prvt_Yadv
              19 hours ago











              4














              A generic solution for masking out a particular pattern of lines from a file:



              #!/bin/sh

              # The pattern is given on the command line.
              pattern=$1

              # The period is simply the length of the pattern.
              period=$#pattern

              # Use bc to convert the binary pattern to an integer.
              mask=$( printf 'ibase=2; %sn' "$pattern" | bc )

              awk -v mask="$mask" -v period="$period" '
              BEGIN p = lshift(1, period-1)
              and(rshift(p, (FNR-1) % period), mask)'


              This relies on awk implementing the non-standard functions and() (bitwise AND), rshift() and lshift() (bitwise right and left shift), which both GNU awk and some BSD implementations of awk does, but not mawk.



              This takes a pattern, which is a binary number representing both the cyclic period and what lines within each period should be kept or masked out. A 1 means "keep" and a 0 means "delete".



              For example: The pattern of line that should be applied in your question is 11100, which means "for each set of five lines, keep the first three and delete the others".



              Using 01001000 would delete all but the 2nd and 5th lines in every 8 lines.



              The awk program could also be written without the BEGIN block as



              and(lshift(1, (period-1) - (FNR-1) % period), mask)


              Left-shifting 1 by (period-1) - (FNR-1) % period positions is the same as calculating 2 to that power, but I'm using lshift() since awk does its arithmetics using floating point operations rather than in exact integer arithmetics.



              Since the code relies on the binary representation of the pattern, very long patterns may not work well.



              Testing:



              Removing the lines you want to remove:



              $ sh script.sh 11100 <file
              1st line (keep)
              2nd line (keep)
              3rd line (keep)
              6th (keep)
              7nth (keep)
              8th lines (keep)
              11th (keep)
              12th (keep)
              13th (keep)


              Inverting the pattern:



              $ sh script.sh 00011 <file
              4rth lines (delete)
              5th (del)
              9th (del)
              10th (del)
              14th (del)
              15th (del)





              share|improve this answer





























                4














                A generic solution for masking out a particular pattern of lines from a file:



                #!/bin/sh

                # The pattern is given on the command line.
                pattern=$1

                # The period is simply the length of the pattern.
                period=$#pattern

                # Use bc to convert the binary pattern to an integer.
                mask=$( printf 'ibase=2; %sn' "$pattern" | bc )

                awk -v mask="$mask" -v period="$period" '
                BEGIN p = lshift(1, period-1)
                and(rshift(p, (FNR-1) % period), mask)'


                This relies on awk implementing the non-standard functions and() (bitwise AND), rshift() and lshift() (bitwise right and left shift), which both GNU awk and some BSD implementations of awk does, but not mawk.



                This takes a pattern, which is a binary number representing both the cyclic period and what lines within each period should be kept or masked out. A 1 means "keep" and a 0 means "delete".



                For example: The pattern of line that should be applied in your question is 11100, which means "for each set of five lines, keep the first three and delete the others".



                Using 01001000 would delete all but the 2nd and 5th lines in every 8 lines.



                The awk program could also be written without the BEGIN block as



                and(lshift(1, (period-1) - (FNR-1) % period), mask)


                Left-shifting 1 by (period-1) - (FNR-1) % period positions is the same as calculating 2 to that power, but I'm using lshift() since awk does its arithmetics using floating point operations rather than in exact integer arithmetics.



                Since the code relies on the binary representation of the pattern, very long patterns may not work well.



                Testing:



                Removing the lines you want to remove:



                $ sh script.sh 11100 <file
                1st line (keep)
                2nd line (keep)
                3rd line (keep)
                6th (keep)
                7nth (keep)
                8th lines (keep)
                11th (keep)
                12th (keep)
                13th (keep)


                Inverting the pattern:



                $ sh script.sh 00011 <file
                4rth lines (delete)
                5th (del)
                9th (del)
                10th (del)
                14th (del)
                15th (del)





                share|improve this answer



























                  4












                  4








                  4







                  A generic solution for masking out a particular pattern of lines from a file:



                  #!/bin/sh

                  # The pattern is given on the command line.
                  pattern=$1

                  # The period is simply the length of the pattern.
                  period=$#pattern

                  # Use bc to convert the binary pattern to an integer.
                  mask=$( printf 'ibase=2; %sn' "$pattern" | bc )

                  awk -v mask="$mask" -v period="$period" '
                  BEGIN p = lshift(1, period-1)
                  and(rshift(p, (FNR-1) % period), mask)'


                  This relies on awk implementing the non-standard functions and() (bitwise AND), rshift() and lshift() (bitwise right and left shift), which both GNU awk and some BSD implementations of awk does, but not mawk.



                  This takes a pattern, which is a binary number representing both the cyclic period and what lines within each period should be kept or masked out. A 1 means "keep" and a 0 means "delete".



                  For example: The pattern of line that should be applied in your question is 11100, which means "for each set of five lines, keep the first three and delete the others".



                  Using 01001000 would delete all but the 2nd and 5th lines in every 8 lines.



                  The awk program could also be written without the BEGIN block as



                  and(lshift(1, (period-1) - (FNR-1) % period), mask)


                  Left-shifting 1 by (period-1) - (FNR-1) % period positions is the same as calculating 2 to that power, but I'm using lshift() since awk does its arithmetics using floating point operations rather than in exact integer arithmetics.



                  Since the code relies on the binary representation of the pattern, very long patterns may not work well.



                  Testing:



                  Removing the lines you want to remove:



                  $ sh script.sh 11100 <file
                  1st line (keep)
                  2nd line (keep)
                  3rd line (keep)
                  6th (keep)
                  7nth (keep)
                  8th lines (keep)
                  11th (keep)
                  12th (keep)
                  13th (keep)


                  Inverting the pattern:



                  $ sh script.sh 00011 <file
                  4rth lines (delete)
                  5th (del)
                  9th (del)
                  10th (del)
                  14th (del)
                  15th (del)





                  share|improve this answer















                  A generic solution for masking out a particular pattern of lines from a file:



                  #!/bin/sh

                  # The pattern is given on the command line.
                  pattern=$1

                  # The period is simply the length of the pattern.
                  period=$#pattern

                  # Use bc to convert the binary pattern to an integer.
                  mask=$( printf 'ibase=2; %sn' "$pattern" | bc )

                  awk -v mask="$mask" -v period="$period" '
                  BEGIN p = lshift(1, period-1)
                  and(rshift(p, (FNR-1) % period), mask)'


                  This relies on awk implementing the non-standard functions and() (bitwise AND), rshift() and lshift() (bitwise right and left shift), which both GNU awk and some BSD implementations of awk does, but not mawk.



                  This takes a pattern, which is a binary number representing both the cyclic period and what lines within each period should be kept or masked out. A 1 means "keep" and a 0 means "delete".



                  For example: The pattern of line that should be applied in your question is 11100, which means "for each set of five lines, keep the first three and delete the others".



                  Using 01001000 would delete all but the 2nd and 5th lines in every 8 lines.



                  The awk program could also be written without the BEGIN block as



                  and(lshift(1, (period-1) - (FNR-1) % period), mask)


                  Left-shifting 1 by (period-1) - (FNR-1) % period positions is the same as calculating 2 to that power, but I'm using lshift() since awk does its arithmetics using floating point operations rather than in exact integer arithmetics.



                  Since the code relies on the binary representation of the pattern, very long patterns may not work well.



                  Testing:



                  Removing the lines you want to remove:



                  $ sh script.sh 11100 <file
                  1st line (keep)
                  2nd line (keep)
                  3rd line (keep)
                  6th (keep)
                  7nth (keep)
                  8th lines (keep)
                  11th (keep)
                  12th (keep)
                  13th (keep)


                  Inverting the pattern:



                  $ sh script.sh 00011 <file
                  4rth lines (delete)
                  5th (del)
                  9th (del)
                  10th (del)
                  14th (del)
                  15th (del)






                  share|improve this answer














                  share|improve this answer



                  share|improve this answer








                  edited 15 hours ago

























                  answered 18 hours ago









                  KusalanandaKusalananda

                  138k17258428




                  138k17258428





















                      4














                      This can be solved using GNU sed:



                      sed '4~5,5~5d' file


                      Note that this uses a GNU-specific extension to the sed standard, and thus doesn't work with e.g. BSD sed on macOS. However, GNU sed can be installed on macOS using brew, after which it can be used as gsed. On Linux, GNU sed is the default.



                      This prints every line that does not fall in the fourth till fifth line of every five lines; for a clearer example: sed '3~10,6~10d' fill select lines 1, 2, 7, 8, 9, 10 of every group of 10 lines by deleting lines 3 till 6.



                      The top-voted answer suggests using awk '(NR-1)%5<3'. On my machine, on a file containing the numbers 1 till 2 million, this takes about 0.6 seconds, while the sed solution in this answer takes about 0.35 seconds. This is reasonable, since sed is in general a simpler tool, and can thus work faster than the more complicated, but more full-featured, awk.






                      share|improve this answer








                      New contributor




                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.















                      • 2





                        +1 ... or 4~5N;d;

                        – steeldriver
                        13 hours ago















                      4














                      This can be solved using GNU sed:



                      sed '4~5,5~5d' file


                      Note that this uses a GNU-specific extension to the sed standard, and thus doesn't work with e.g. BSD sed on macOS. However, GNU sed can be installed on macOS using brew, after which it can be used as gsed. On Linux, GNU sed is the default.



                      This prints every line that does not fall in the fourth till fifth line of every five lines; for a clearer example: sed '3~10,6~10d' fill select lines 1, 2, 7, 8, 9, 10 of every group of 10 lines by deleting lines 3 till 6.



                      The top-voted answer suggests using awk '(NR-1)%5<3'. On my machine, on a file containing the numbers 1 till 2 million, this takes about 0.6 seconds, while the sed solution in this answer takes about 0.35 seconds. This is reasonable, since sed is in general a simpler tool, and can thus work faster than the more complicated, but more full-featured, awk.






                      share|improve this answer








                      New contributor




                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.















                      • 2





                        +1 ... or 4~5N;d;

                        – steeldriver
                        13 hours ago













                      4












                      4








                      4







                      This can be solved using GNU sed:



                      sed '4~5,5~5d' file


                      Note that this uses a GNU-specific extension to the sed standard, and thus doesn't work with e.g. BSD sed on macOS. However, GNU sed can be installed on macOS using brew, after which it can be used as gsed. On Linux, GNU sed is the default.



                      This prints every line that does not fall in the fourth till fifth line of every five lines; for a clearer example: sed '3~10,6~10d' fill select lines 1, 2, 7, 8, 9, 10 of every group of 10 lines by deleting lines 3 till 6.



                      The top-voted answer suggests using awk '(NR-1)%5<3'. On my machine, on a file containing the numbers 1 till 2 million, this takes about 0.6 seconds, while the sed solution in this answer takes about 0.35 seconds. This is reasonable, since sed is in general a simpler tool, and can thus work faster than the more complicated, but more full-featured, awk.






                      share|improve this answer








                      New contributor




                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.










                      This can be solved using GNU sed:



                      sed '4~5,5~5d' file


                      Note that this uses a GNU-specific extension to the sed standard, and thus doesn't work with e.g. BSD sed on macOS. However, GNU sed can be installed on macOS using brew, after which it can be used as gsed. On Linux, GNU sed is the default.



                      This prints every line that does not fall in the fourth till fifth line of every five lines; for a clearer example: sed '3~10,6~10d' fill select lines 1, 2, 7, 8, 9, 10 of every group of 10 lines by deleting lines 3 till 6.



                      The top-voted answer suggests using awk '(NR-1)%5<3'. On my machine, on a file containing the numbers 1 till 2 million, this takes about 0.6 seconds, while the sed solution in this answer takes about 0.35 seconds. This is reasonable, since sed is in general a simpler tool, and can thus work faster than the more complicated, but more full-featured, awk.







                      share|improve this answer








                      New contributor




                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.









                      share|improve this answer



                      share|improve this answer






                      New contributor




                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.









                      answered 15 hours ago









                      tomsmedingtomsmeding

                      1413




                      1413




                      New contributor




                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.





                      New contributor





                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.






                      tomsmeding is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                      Check out our Code of Conduct.







                      • 2





                        +1 ... or 4~5N;d;

                        – steeldriver
                        13 hours ago












                      • 2





                        +1 ... or 4~5N;d;

                        – steeldriver
                        13 hours ago







                      2




                      2





                      +1 ... or 4~5N;d;

                      – steeldriver
                      13 hours ago





                      +1 ... or 4~5N;d;

                      – steeldriver
                      13 hours ago











                      0














                      Tried with below command and it worked fine



                      for((i=1;i<=20;i++)); do j=$(($i+2)); sed -n ''$i','$j'p' filename;i=$(($j+2)); done


                      output



                      1st line (keep)
                      2nd line (keep)
                      3rd line (keep)
                      6th (keep)
                      7nth (keep)
                      8th lines (keep)
                      11th (keep)
                      12th (keep)
                      13th (keep)





                      share|improve this answer


















                      • 1





                        That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

                        – Law29
                        17 hours ago















                      0














                      Tried with below command and it worked fine



                      for((i=1;i<=20;i++)); do j=$(($i+2)); sed -n ''$i','$j'p' filename;i=$(($j+2)); done


                      output



                      1st line (keep)
                      2nd line (keep)
                      3rd line (keep)
                      6th (keep)
                      7nth (keep)
                      8th lines (keep)
                      11th (keep)
                      12th (keep)
                      13th (keep)





                      share|improve this answer


















                      • 1





                        That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

                        – Law29
                        17 hours ago













                      0












                      0








                      0







                      Tried with below command and it worked fine



                      for((i=1;i<=20;i++)); do j=$(($i+2)); sed -n ''$i','$j'p' filename;i=$(($j+2)); done


                      output



                      1st line (keep)
                      2nd line (keep)
                      3rd line (keep)
                      6th (keep)
                      7nth (keep)
                      8th lines (keep)
                      11th (keep)
                      12th (keep)
                      13th (keep)





                      share|improve this answer













                      Tried with below command and it worked fine



                      for((i=1;i<=20;i++)); do j=$(($i+2)); sed -n ''$i','$j'p' filename;i=$(($j+2)); done


                      output



                      1st line (keep)
                      2nd line (keep)
                      3rd line (keep)
                      6th (keep)
                      7nth (keep)
                      8th lines (keep)
                      11th (keep)
                      12th (keep)
                      13th (keep)






                      share|improve this answer












                      share|improve this answer



                      share|improve this answer










                      answered 21 hours ago









                      Praveen Kumar BSPraveen Kumar BS

                      1,6981311




                      1,6981311







                      • 1





                        That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

                        – Law29
                        17 hours ago












                      • 1





                        That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

                        – Law29
                        17 hours ago







                      1




                      1





                      That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

                      – Law29
                      17 hours ago





                      That is nice, but you have know how many lines you have in advance, and you're looping back from the beginning each round. It cannot be used on a stream, and it gets more inefficient the bigger the data gets, so since OP says the number of lines is very large, this is not the best solution.

                      – Law29
                      17 hours ago










                      Jaguar Jom is a new contributor. Be nice, and check out our Code of Conduct.









                      draft saved

                      draft discarded


















                      Jaguar Jom is a new contributor. Be nice, and check out our Code of Conduct.












                      Jaguar Jom is a new contributor. Be nice, and check out our Code of Conduct.











                      Jaguar Jom is a new contributor. Be nice, and check out our Code of Conduct.














                      Thanks for contributing an answer to Unix & Linux Stack Exchange!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid


                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.

                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function ()
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f509553%2fhow-to-delete-every-two-lines-after-3rd-lines-in-a-file-contains-very-large-numb%23new-answer', 'question_page');

                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      Invision Community Contents History See also References External links Navigation menuProprietaryinvisioncommunity.comIPS Community ForumsIPS Community Forumsthis blog entry"License Changes, IP.Board 3.4, and the Future""Interview -- Matt Mecham of Ibforums""CEO Invision Power Board, Matt Mecham Is a Liar, Thief!"IPB License Explanation 1.3, 1.3.1, 2.0, and 2.1ArchivedSecurity Fixes, Updates And Enhancements For IPB 1.3.1Archived"New Demo Accounts - Invision Power Services"the original"New Default Skin"the original"Invision Power Board 3.0.0 and Applications Released"the original"Archived copy"the original"Perpetual licenses being done away with""Release Notes - Invision Power Services""Introducing: IPS Community Suite 4!"Invision Community Release Notes

                      Canceling a color specificationRandomly assigning color to Graphics3D objects?Default color for Filling in Mathematica 9Coloring specific elements of sets with a prime modified order in an array plotHow to pick a color differing significantly from the colors already in a given color list?Detection of the text colorColor numbers based on their valueCan color schemes for use with ColorData include opacity specification?My dynamic color schemes

                      Tom Holland Mục lục Đầu đời và giáo dục | Sự nghiệp | Cuộc sống cá nhân | Phim tham gia | Giải thưởng và đề cử | Chú thích | Liên kết ngoài | Trình đơn chuyển hướngProfile“Person Details for Thomas Stanley Holland, "England and Wales Birth Registration Index, 1837-2008" — FamilySearch.org”"Meet Tom Holland... the 16-year-old star of The Impossible""Schoolboy actor Tom Holland finds himself in Oscar contention for role in tsunami drama"“Naomi Watts on the Prince William and Harry's reaction to her film about the late Princess Diana”lưu trữ"Holland and Pflueger Are West End's Two New 'Billy Elliots'""I'm so envious of my son, the movie star! British writer Dominic Holland's spent 20 years trying to crack Hollywood - but he's been beaten to it by a very unlikely rival"“Richard and Margaret Povey of Jersey, Channel Islands, UK: Information about Thomas Stanley Holland”"Tom Holland to play Billy Elliot""New Billy Elliot leaving the garage"Billy Elliot the Musical - Tom Holland - Billy"A Tale of four Billys: Tom Holland""The Feel Good Factor""Thames Christian College schoolboys join Myleene Klass for The Feelgood Factor""Government launches £600,000 arts bursaries pilot""BILLY's Chapman, Holland, Gardner & Jackson-Keen Visit Prime Minister""Elton John 'blown away' by Billy Elliot fifth birthday" (video with John's interview and fragments of Holland's performance)"First News interviews Arrietty's Tom Holland"“33rd Critics' Circle Film Awards winners”“National Board of Review Current Awards”Bản gốc"Ron Howard Whaling Tale 'In The Heart Of The Sea' Casts Tom Holland"“'Spider-Man' Finds Tom Holland to Star as New Web-Slinger”lưu trữ“Captain America: Civil War (2016)”“Film Review: ‘Captain America: Civil War’”lưu trữ“‘Captain America: Civil War’ review: Choose your own avenger”lưu trữ“The Lost City of Z reviews”“Sony Pictures and Marvel Studios Find Their 'Spider-Man' Star and Director”“‘Mary Magdalene’, ‘Current War’ & ‘Wind River’ Get 2017 Release Dates From Weinstein”“Lionsgate Unleashing Daisy Ridley & Tom Holland Starrer ‘Chaos Walking’ In Cannes”“PTA's 'Master' Leads Chicago Film Critics Nominations, UPDATED: Houston and Indiana Critics Nominations”“Nominaciones Goya 2013 Telecinco Cinema – ENG”“Jameson Empire Film Awards: Martin Freeman wins best actor for performance in The Hobbit”“34th Annual Young Artist Awards”Bản gốc“Teen Choice Awards 2016—Captain America: Civil War Leads Second Wave of Nominations”“BAFTA Film Award Nominations: ‘La La Land’ Leads Race”“Saturn Awards Nominations 2017: 'Rogue One,' 'Walking Dead' Lead”Tom HollandTom HollandTom HollandTom Hollandmedia.gettyimages.comWorldCat Identities300279794no20130442900000 0004 0355 42791085670554170004732cb16706349t(data)XX5557367