Welcome to Law of Large Numbers. The larger the number of test performed, the closer to the probable average you get.



Simply put, 4 is a pretty bad sample size.