• Buddahriffic@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 days ago

    It’s about 300 samples for an estimate of the distribution with a 95% confidence iirc. That’s assuming the samples are representative (unbiased) and 95% confidence doesn’t mean it’s within 95% of reality, but that 5% of tests run in such a way would be expected to be inaccurate (and there’s no way of knowing for sure which one this particular sample is because even a meta study will have such an error rate, though you can increase the confidence with more samples or studies, just never to 100% unless you study every possible sample, including future ones).

      • Buddahriffic@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 day ago

        Then any statistics you measure on that population might be fully accurate for those 100 but might be less able to predict what the next 100 will look like.

        You can still measure stats with smaller groups, it just means the confidence interval is smaller. With 300, there’s a 95% chance your test results are close to reality. With 100 it might be more like 66%.

        • TheBlackLounge@lemmy.zip
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 day ago

          Population is a statistical term which means “everything”. There is no “next 100”.

          The 300 number is specifically about very big populations where you’re trying to measure something like an average of an unknown variable. It doesn’t apply to just anything statistics.

          • Buddahriffic@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            24 hours ago

            I meant like births, as in even if you can enumerate every single individual, statistics can apply to future members that don’t yet exist.

            And yeah, it’s been a while and I remembered that the proof didn’t depend on the population size but forgot that it assumed a large population size in the first place. I was wrong.