Re: locating duplicate records in a 300,000 record file by Shane
Shane
Sat Mar 15 10:03:34 CDT 2008
This is a multi-part message in MIME format.
------=_NextPart_000_0072_01C88673.1109E000
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Hi,
Since you are using 2007, if you want to remove the duplicates you can =
choose the command Data, Remove Duplicates. As already mentioned you =
can use conditional formatting to mark the duplicates. But with 300000 =
you may want to locate the found records quickly, so:
1. After applying the conditional format mentioned in a previous reply
2. Select the 300000 rows and choose the command Home, Sort & Filter, =
Custom Filter
3. In the Sort box under Sort by pick the column you want to sort on, =
under Sort on pick Cell Color, under Order select No Cell Color and On =
Bottom, then click OK.
This will put all the duplicates at the top.
Cheers,
Shane Devenshire
Microsoft Excel MVP
"robro" <rrothberg@gmail.com> wrote in message =
news:ac0e781d-8b81-4dd1-bcec-53de5a88cbe9@i7g2000prf.googlegroups.com...
> I previously had help with writing a macro to eliminate duplicateh
> records, and it worked great which told me there are two duplicate
> records. Now I need to quickly identify them.
>=20
> There is one column of numbers, sorted in numeric order. I just need
> to place an X
> in a helper column for each duplicate record so I can then sort by the
> helper column. Basically for this to be fast, I just want to compare
> one cell with the one above it. I've yet to figure out how to do
> that. Any help appreciated! I've googled up a storm and all I'm
> finding is deleting dupes or placing an X in a helper column but
> comparing all records in column A which will take too long with
> 300,000 records.
>=20
> Thanks!
------=_NextPart_000_0072_01C88673.1109E000
Content-Type: text/html;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dunicode">
<META content=3D"MSHTML 6.00.6000.16609" name=3DGENERATOR></HEAD>
<BODY id=3DMailContainerBody=20
style=3D"PADDING-RIGHT: 10px; PADDING-LEFT: 10px; PADDING-TOP: 15px"=20
bgColor=3D#ffffff leftMargin=3D0 topMargin=3D0 CanvasTabStop=3D"true"=20
name=3D"Compose message area">
<DIV><FONT face=3DArial size=3D2>Hi,</FONT></DIV>
<DIV><FONT face=3DArial size=3D2></FONT> </DIV>
<DIV><FONT face=3DArial size=3D2>Since you are using 2007, if you want =
to remove the=20
duplicates you can choose the command Data, Remove Duplicates. As =
already=20
mentioned you can use conditional formatting to mark the =
duplicates. But=20
with 300000 you may want to locate the found records quickly, =
so:</FONT></DIV>
<DIV><FONT face=3DArial size=3D2></FONT> </DIV>
<DIV><FONT face=3DArial size=3D2>1. After applying the conditional =
format mentioned=20
in a previous reply</FONT></DIV>
<DIV><FONT face=3DArial size=3D2>2. Select the 300000 rows and choose =
the command=20
Home, Sort & Filter, Custom Filter</FONT></DIV>
<DIV><FONT face=3DArial size=3D2>3. In the Sort box under Sort by pick =
the column=20
you want to sort on, under Sort on pick Cell Color, under Order select =
No Cell=20
Color and On Bottom, then click OK.</FONT></DIV>
<DIV><FONT face=3DArial size=3D2></FONT> </DIV>
<DIV><FONT face=3DArial size=3D2>This will put all the duplicates at the =
top.</FONT></DIV>
<DIV><FONT face=3DArial size=3D2></FONT> </DIV>
<DIV><FONT face=3DArial size=3D2>Cheers,</FONT></DIV>
<DIV><FONT face=3DArial size=3D2>Shane Devenshire</FONT></DIV>
<DIV><FONT face=3DArial size=3D2>Microsoft Excel MVP</FONT></DIV>
<DIV><BR><BR>"robro" <rrothberg@gmail.com> wrote in message=20
news:ac0e781d-8b81-4dd1-bcec-53de5a88cbe9@i7g2000prf.googlegroups.com...<=
BR>>=20
I previously had help with writing a macro to eliminate =
duplicateh<BR>>=20
records, and it worked great which told me there are two =
duplicate<BR>>=20
records. Now I need to quickly identify them.<BR>> <BR>> =
There is=20
one column of numbers, sorted in numeric order. I just =
need<BR>> to=20
place an X<BR>> in a helper column for each duplicate record so I can =
then=20
sort by the<BR>> helper column. Basically for this to be fast, =
I just=20
want to compare<BR>> one cell with the one above it. I've yet =
to figure=20
out how to do<BR>> that. Any help appreciated! I've =
googled up a=20
storm and all I'm<BR>> finding is deleting dupes or placing an X in a =
helper=20
column but<BR>> comparing all records in column A which will take too =
long=20
with<BR>> 300,000 records.<BR>> <BR>> =
Thanks!</DIV></BODY></HTML>
------=_NextPart_000_0072_01C88673.1109E000--