Suggest a change

These fields are all optional and need only
be supplied if you would like a direct reply.

Subject
Your email address
Your real name

You must answer this!

If you don't, my spam filtering will
ensure that I never see your email.

What's 8 plus five (in digits only)?

Please make your changes here and then
Editing tips and layout rules.

File: FindingPerrinPseudoPrimes_Part1

''' <link rel="alternate" type="application/rss+xml"
'''       href="/rss.xml" title="RSS Feed">

********> width="25%"
|>>
''' <a title="Subscribe to my feed"
'''    rel="alternate"
'''    href="https://www.solipsys.co.uk/rss.xml">
'''    <img style="border-width: 0px;"
'''         src="https://www.feedburner.com/fb/images/pub/feed-icon32x32.png"
'''         align="middle"
'''         alt="" />Subscribe!</a>
_
''' <a href="https://twitter.com/ColinTheMathmo">
''' <img src="https://www.solipsys.co.uk/new/images/TwitterButton.png"
'''      title="By: TwitterButtons.net"
'''      width="212" height="69"
'''      alt="@ColinTheMathmo"
'''     /></a>
<<|
----
My latest posts can be
found here:
  * ColinsBlog
----
Previous blog posts:
  * TheUnwiseUpdate
  * MilesPerGallon
  * TrackingAnItemOnHackerNews
  * HackerNewsUserAges
  * PokingTheDustyCorners
  * ThereIsNoTimeForThis
  * PublicallySharingLinks
  * LearningTimesTables
  * GracefulDegradation
  * DiagrammingMathsTopics
  * OnTheRack
  * SquareRootByLongDivision
  * BeyondTheBoundary
  * FillInTheGaps
  * SoftwareChecklist
  * NASASpaceCrews
  * TheBirthdayParadox
  * TheTrapeziumConundrum
  * RevisitingTheAnt
  * TheAntAndTheRubberBand
  * IrrationalsExist
  * MultipleChoiceProbabilityPuzzle
  * RandomEratosthenes
  * WrappingUpSquareDissection
  * DissectingASquarePart2
  * DissectingACircle
  * DissectingASquare
  * AnOddityInTennis
  * DecisionTreeForTennis
  * DecisionTreesInGames
  * AMatterOfConvention
  * DoYouNourishOrTarnish
  * BinarySearchReconsidered
  * TwoEqualsFour
  * TheLostPropertyOffice
  * TheForgivingUserInterface
  * SettingUpRSS
  * WithdrawingFromHackerNews
----
Additionally, some earlier writings:
  * RandomWritings.
  * ColinsBlog2010
  * ColinsBlog2009
  * ColinsBlog2008
  * ColinsBlog2007
  * ColinsBlogBefore2007
''' <img src="/cgi-bin/CountHits.py?FPPP_1" alt="" />
********
!!! Finding Perrin Pseudo-primes: _ Part 1

Some 20 years ago I was chatting with a friend of mine ( AndrewLipson )
and he said something like the following:

[[[> This isn't an exact _ quotation, but it's _ close enough. ]]]
  * "A chap at work who does maths, but is not actually
     a mathematician, asked me the following. _ _ Consider
     the sequence $k(n)$ with
       * $k(1)=0,\;k(2)=2,\;k(3)=3,$ and
       * $k(n)=k(n-2)+k(n-3).$
     Why is it true that $n$ divides $k(n)$ if and only if $n$ is prime?"

My immediate response was "Well, it's not true." and Andrew's reply
was swift and to the point:

|>> [[[ |>> OK then: Find a _ counter-example. <<| ]]] <<|

So I started on that.  I figured Andrew had already done the basic
work of looking for a small counter-example, but I also figured that
I would need to get warmed up and start to get a feel for the problem.

Before that, I need to put this into some historical context.  This
was when, if you wanted to do arithmetic on more than 32 bit numbers,
you had to write your own arbitrary arithmetic package.  It was before
the WWW as we know it, although not pre-internet /per/se./  So some of
what follows will seem unnecessary.  If you think that, I invite you
to find what is common for the first 1000 counter-examples, except for
a small number of them, and tell me which ones do not follow the pattern.

And so ...

[[[>
| /n/ | /k(n)/ | Prime | /Divides/ | Confirmed |
| |>> 2 <<| | |>> 2 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 3 <<| | |>> 3 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 4 <<| | |>> 2 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 5 <<| | |>> 5 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 6 <<| | |>> 5 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 7 <<| | |>> 7 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 8 <<| | |>> 10 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 9 <<| | |>> 12 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 10 <<| | |>> 17 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 11 <<| | |>> 22 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 12 <<| | |>> 29 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 13 <<| | |>> 39 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 14 <<| | |>> 51 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 15 <<| | |>> 68 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 16 <<| | |>> 90 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 17 <<| | |>> 119 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 18 <<| | |>> 158 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 19 <<| | |>> 209 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 20 <<| | |>> 277 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 21 <<| | |>> 367 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 22 <<| | |>> 486 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 23 <<| | |>> 644 <<| | |>> True <<| | |>> True <<| | |>> Yes <<| |
| |>> 24 <<| | |>> 853 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |
| |>> 25 <<| | |>> 1130 <<| | |>> False <<| | |>> False <<| | |>> Yes <<| |

|>> /and/so/on/.../ <<|
]]]
OK, so working it by hand we can see that it's true up to 25, but we
can also see that /k(n)/ is starting to grow quite rapidly.  Some
messing about by hand shows that it's going to be exponential, just
as the Fibonacci sequence is exponential.  Not a surprise, really,
since this is very Fibonacci-like, with each term defined by adding
two previous terms.

We can work out the ratio between successive terms and see that it
starts to converge quite rapidly to about 1.324736...  In fact we
can show that:

$$k(n)=\alpha^n+\beta^n+\gamma^n$$

where $\alpha,$ $\beta,$ and $\gamma$ are solutions to the
equation:

$$x^3=x+1$$

(Usually we need constants in front of the powers of the roots,
but in this case the constants all turn out to be 1).

[[[ [[[> Skip this _ if you _ want to. ]]]
That's not magic, that's called the "characteristic equation" and
you can see that the coefficients are related to the definition.

No?  Can't see that?  Well the definition is:

$$k(n)=a.k(n-1)+b.k(n-2)+c.k(n-3)$$

where /a=0,/ /b=1,/ and /c=1./  Then the characteristic equation
is $x^3=a.x^2+b.x+c.$

Anyway ...
]]]

The roots to that cubic are:

* $\alpha\approx{}1.324736\ldots$
  * $\beta\approx{}-0.66236-0.56228{}i$
  * $\gamma\approx{}-0.66236+0.56228{}i$

These two last have absolute value less than 1, so as /n/ gets bigger,
$\beta^n$ and $\gamma^n$ approach 0.  Since
$k(n)=\alpha^n+\beta^n+\gamma^n,$ with $\beta^n$ and $\gamma^n$
disappearing, so $k(n)\rightarrow\alpha^n$ as /n/ gets larger.

Now the number of digits in a number /m/ (base 10) is given by /log(m)/
(base 10), so that means the number of digits in /k(n)/ is /log(k(n))/
which is roughly $\log(\alpha^n),$ which in turn is $n.\log(\alpha).$

Now $\log(\alpha)$ is about 0.122..., so by the time /n/ is 100,
/k(n)/ will have 12 digits (actually 13, because we should round up),
and it will only get worse.

So we can't go far with this by hand.  But that's OK, because what
we want to know is this:

* When /n/ is prime,
    * does /n/ divide /k(n)/ ?

|>> [[[ |>>
The sequence is called _
the Perrin Sequence, _
so I call this the _
"Perrin Test."
<<| ]]] <<|

[[[>
{{{
#!/usr/bin/python

from time import clock

for limit_power in [0, 1, 2, 3, 4]:

n = 2
    limit = 4**limit_power
    t_start = clock()

while clock()-t_start < limit:

a,b,c = 3,0,2
        for i in xrange(n):
            a,b,c = b,c,(a+b) % n

n += 1

print limit, n, float(n)/2**limit_power
}}}
]]]
If we compute /k(n)/ modulo /n/ then it will divide if the answer
is 0, and because we're working modulo /n/ the numbers in the
intermediate calculation don't grow too quickly.  The only problem
then is that after each /n,/ we have to start all over again for
the next one.  That means that the time taken grows quadratically,
and that will kill us fairly quickly - we won't get that far.

So here we have some Python code to compute /k(n)/mod/n/ for as long
as allowed by some time limit, and then we print how far we got. We're
not actually doing any of the searching for a counter-example in this
code, since we're not actually seeing whether the numbers are predicted
to be prime, nor are we seeing if they actually are prime. This is just
showing that it's going to be a slow process if we use this method.

[[[<
|     Secs    | |>>  *n*  <<| |   n/sqrt(t)  |
| |>>   1 <<| | |>>  3133 <<| | |>> 3133.0 <<| |
| |>>   4 <<| | |>>  6229 <<| | |>> 3114.5 <<| |
| |>>  16 <<| | |>> 12562 <<| | |>> 3140.5 <<| |
| |>>  64 <<| | |>> 25179 <<| | |>> 3147.4 <<| |
| |>> 256 <<| | |>> 50642 <<| | |>> 3165.1 <<| |
]]]
The code multiplies the time limit by 4 each time, and we
can see that we get twice as far.  That means that to get twice
as far takes four times as long, so, as expected, the time taken
grows quadratically.  The timings on my machine are shown at left.

[[[>
{{{
#!/usr/bin/python

from time import clock
from sys import argv

def is_prime_q(n):

if n in [2, 3, 5, 7]: return True
    if n < 11           : return False

if n % 2 == 0       : return False
    if pow(2,n-1,n) != 1: return False
    if n % 3 == 0       : return False
    if pow(3,n-1,n) != 1: return False
    if n % 5 == 0       : return False
    if pow(5,n-1,n) != 1: return False

d = 7
    while d*d <= n:
        if (n % d) == 0: return False
        d += 2

return True

def is_perrin_q(n):

if n in [2, 3]: return True

a0, a1, a2 = 3, 0, 2

i = n
    while i > 0:

a0, a1, a2 = a1, a2, (a0+a1) % n
        i -= 1

return a0 == 0

# ======================================

n = 2
t = 1

while clock() < int(argv[1]):

is_prime = is_prime_q(n)
    is_perrin = is_perrin_q(n)
    if is_prime != is_perrin:
        print n, is_prime, is_perrin

if t < clock():
        print 'Tested to %d in %d secs' % (n,t)
        t += 1

n += 1
}}}
]]]
As you can see, it takes 4 minutes to get to 50 thousand, so that's
not getting much done.  We can derive a formula for how far we get
in a given time:

$$n\;\approx\;3150\sqrt(t)$$

So here we have the code to start the testing process.  We have a
simple test for primality, first testing for being a small prime,
then using Fermat's Little Theorem, and finally performing trial
division up to the square root of the number being tested.

Then we have the routine to test the Perrin condition.  We start
with the first few terms in the Perrin sequence, then advance up
the sequence, working modulo /n/ to avoid having the numbers get
stupidly large.  In the end, we return /True/ if the relevant term
in the sequence is zero, indicating that /n/ divides /k(n)./

The main loop starts at /n=2/ and then tests to see if the number
is prime, and whether it passes the Perrin test.  If these answers
don't match, then the number is printed, and is a counter-example.

Specific measurements give:

* Roughly 104 per second when n=100k
  * Roughly 42 per second when n=250k
  * Roughly 10 per second when n=1000k

We also print every second how far we've got so far, so we can
measure how fast the code runs, and then see how we can improve it.
Running it for 1000 seconds and fitting an equation, we get an
approximate formula for its speed:

$$n\;\approx\;4575\sqrt{t}$$

This is all very rough and ready, and will fall apart as we go
significantly further, but it's enough to get an idea of what's
going on.  We can see that getting to a million will take about
13 hours, so we really, really want to speed this up.

So what do we do?  How can we get further, faster?

In Part 2 we'll look at algorithmic improvements.
----

|>>
| |>> <<<< Prev <<<< ---- TheUnwiseUpdate <<| | : | |>> >>>> Next >>>> ---- FindingPerrinPseudoPrimes_Part2 ... <<| |
----
********>
''' <a href="https://twitter.com/ColinTheMathmo">You should follow me on twitter</a>
********
''' <a href="https://twitter.com/ColinTheMathmo">
''' <img src="https://www.solipsys.co.uk/new/images/TwitterButton.png"
'''      title="By: TwitterButtons.net"
'''      width="212" height="69"
'''      alt="@ColinTheMathmo"
'''     /></a>
********<
<<|
----
!! Comments

I've decided no longer to include comments directly via the Disqus
(or any other) system.  Instead, I'd be more than delighted to get
emails from people who wish to make comments or engage in discussion.
Comments will then be integrated into the page as and when they are
appropriate.

If the number of emails/comments gets too large to handle then I
might return to a semi-automated system.  That's looking increasingly
unlikely.

********<