Paul, you are a smart hacker, DPI can not stop you,:-). DPI hardly can tackle encrypted communication now, but this do not prevent DPI from getting more and more market deployment. Maybe someday,most internet communications will be encrypted to circumvent such kind of inspection, then I think new techonolgies will be devised to face these embarrassment. Anyway, I think content protection like copyright protection is a interesting area that worth doing something. ****************************************************************************************** This email and its attachments contain confidential information from HUAWEI, which is intended only for the person or entity whose address is listed above. Any use of the information contained here in any way (including, but not limited to, total or partial disclosure, reproduction, or dissemination) by persons other than the intended recipient(s) is prohibited. If you receive this email in error, please notify the sender by phone or email immediately and delete it! ***************************************************************************************** ----- 原邮件 ----- 发件人: "Paul E. Jones" <paulej@packetizer.com> 日期: 星期五, 九月 26日, 2008 下午2:06 主题: RE: RE: [itu-sg16] Q22 Question - SATC 收件人: 'zourong 52447' <zou.rong@huawei.com> 抄送: ''Noah Luo ' ("罗忠")' <noah@huawei.com>
Rong,
You are right that current file sharing networks (mostly) do not use encryption. But, they would if necessary. In fact, I have a file sharingprogram on my machine that does offer encryption for file transfers. If such measures were employed to block traffic, I think all tools would simply encrypt contents by default. One could try to guess the contents of the encrypted file by looking at such things as the file length, etc., but nothing is perfect.
I can accept that no solution is perfect, but I honestly believe that this one just will not work in practice. I used to be one of the kids in high school that would remove copy protection from software products by manuallyaltering machine code right on the disk. I even wrote my own utilities to allow me to modify code on disk -- and I wrote that in assembly language. Lawful intercept works for a few reasons: 1) Most criminals are stupid and use public communication 2) Almost everybody connects to a carrier network to carry voice -- so carriers become a natural intercept point
Lawful intercept does not work for smart criminals. They use secure communication channels.
But, we can't really compare this to Lawful Intercept. Normal users just use the tools that are out on the Internet. If I were writing a file sharing program and encountered packet filtering issues, I would immediatelyemploy encryption. I could write the necessary code in an afternoon and effectively defeat systems employed to stop it. And, those systems are all very expensive hardware-based solutions that would be rendered useless. Honestly, I'm not trying to be an irritant :-) But, I'm quite doubtful that any DPI scheme can work to stop the proliferation of copyrighted material.It's simply too easy to devise a work-around in software and, while the DPI logic can get more complicated, the algorithms in a PC-based application can be revised and updated far more rapidly. Heck, I would even have the packet encoding algorithms written in such a way as to be downloadable when the program starts. That way, the newest encoding algorithms are always available every time the program starts.
Paul
-----Original Message----- From: zourong 52447 [mailto:zou.rong@huawei.com] Sent: Thursday, September 25, 2008 11:48 PM To: Paul E. Jones Cc: 'Noah Luo ' ("罗忠") Subject: 回复:RE: [itu-sg16] Q22 Question - SATC
Paul,
Sorry for taking so long to respond to your mail because I have taken a long vacation and then followed by a busy business trip.
I agree with you that the method proposed in my contribution AVD 3541> is not a perfect solution to the copy-right protection issue. It could not handle the case that the communication is encrypted ?C as you mentioned. In fact, I think it is one of the limitations of DPI technology. The signature-based DPI technology could not inspect the encrypted data flow. But I still believe that it could work and do something useful in a broad range of use cases, just like the lawful interception: it hardly could handle the encrypted communication but it still can work> well.
As to your suspicion whether it could work in a practical situation.> I hope it could work in some public file sharing scenarios especially the P2P file sharing. Why I put my effort in study the P2P file sharing> scenario is that P2P file sharing is the most popular file sharing tools nowadays and most of the files being shared are multimedia files> and many of them are pirated.
P2P file sharing has one important character that could be exploited:> the file information is almost public. It should let people know what the resource is, for example, the file name, file type, file size, file hash value etc., these file describe information will be transmitted in the file sharing control messages and could be intercepted by SATC.
Another character of P2P file sharing is that it seldom uses encrypted> communication because that will bring burden to its efficiency and affect its popularity.
Then to the question of “good database” and “suspicious database”. What I want to do is a multi-level inspection and filter. It maybe works as following:
Extracting_file_information(); Searching_good_database(); If(got matched item) { Compare_signature(); If(match) { It is copy-righted file and transmitted; } Else { Searching_suspecious_database(); If(match) { It is a known pirated version, stop it; } Else { It maybe a new pirated version, record it temporality and notify the content provider to make sure and to update the suspicious database ; Transmit_flow(); } } }
In this multi-level filtering, the first level ”good database” filtering can be performed with the help of a reasonably small- sized> database and achieve rather fast processing speed, the second level ”suspicious database” filtering may require a much bigger database. But only searching only a part of them each time also could yield reasonable processing speed.
So that is what I want to do, only those truly pirated file sharing will be stopped right now, other suspected file sharing will be recorded and let the technicians to do further judgment.
This email and its attachments contain confidential information from> HUAWEI, which is intended only for the person or entity whose address is listed above. Any use of the information contained here in any way (including, but not limited to, total or partial disclosure, reproduction, or dissemination) by persons other than the intended recipient(s) is prohibited. If you receive this email in error,
***********************************************************************> ******************* please> notify the sender by phone or email
immediately and delete it!
***********************************************************************> ******************
----- 原邮件 ----- 发件人: "Paul E. Jones" <paulej@packetizer.com> 日期: 星期五, 九月 12日, 2008 上午11:50 主题: RE: [itu-sg16] Q22 Question - SATC 收件人: 'Noah Luo(罗忠)' <noah@huawei.com> 抄送: zou.rong@huawei.com
Noah,
I have no objection to exploring the idea. But, I'm 100% certain it will not work :-) I am various serious about that. When I was a teenager, I spent a lot of time removing copy protection from software
(actually> > modifying the machine code by hand), hacking computer systems,> > etc. Even
when I was in college, I had a professor who called me an unscrupulous hacker. He was only half joking. He had a real admiration for my technicalability, but wasn't very happy with some of the hacking I used to do. Still, he recognized that I was a good kid who just wanting to explore and learn as much as possible.
I used to play with viruses. I would even write some that would not cause harm -- they would never replicate. Rather, they would just run in the background of machines that I placed them on, loading into the system at boot time. I still have a collection of probably 1,000 virus files, many with the assembly code.
The short of it is that I have a lot of experience at this sort of thing.While I don't do the hacking I did when I was a kid, I can guarantee you that there will be some just ready to tackle the challenge of breaking any kind of defense like what this standard intends to provide. And, as I said, the simplest thing to do is simply use TLS. In fact, I could exchange pirated music or video via e-mail between my server at home and anybody else who uses TLS -- my servers all use encryption. But, we can certainly employ simpler techniques. For example, I could do this:
K = 256_bit_random_key; while(not_end_of_file) { M = read_file(16 octets of the file); Y = M XOR K; transmit_message(Y); K = rotate_right(K); } transmit_message(K);
This is a very simple and extremely fast encryption algorithm that willactually transmit the session key at the end. So, there is no secrethidden, but it avoids the need of exchanging encryption keys some other way. This is not a flawless procedure, though. In fact, this is very easy to "crack", since we're just XORing the bits of the input stream with the key and then rotating the key 1 bit after each message. If an input stream had a number of 0s, then the key would appear as repeated data blocks that can be easily identified by a cryptography person. Heck, even I would recognizethe pattern. But, this might still be complex enough to avoid real-time detection of pirated content. But, I could just as easily use AES and *really* encrypt the media and place the 256-bit initialization vector and key at the end of the transmitted media stream :-)
Anyway, like I said-- I have no objections to studying this. I just am just a firm believer that it will only stop the casual pirates: those who want to use gmail.com or something to send an e-mail containing music. It would not stop software designed for secure file exchange, either person- to- person or across file sharing networks.
My primary concern now is with the wording in the report. Rong felt it was OK, but I still think it's wrong. It says: "... if SATC finds a data stream carrying content failing to match a known signature, it will block the stream or mark it as being suspicious.”> > Worded another way, this sentence says: "if the data stream contains an unknown data stream, block the data stream."
Now that would actually work to block pirated content, because you essentially only let known data through the network. But, I don't thinkthat is the intent, and it's definite not what the AVD document said.
Paul
-----Original Message----- From: Noah Luo(罗忠) [mailto:noah@huawei.com] Sent: Thursday, September 11, 2008 5:46 AM To: 'Paul E. Jones' Cc: zou.rong@huawei.com Subject: 答复: [itu-sg16] Q22 Question - SATC
Dear Paul Thank u very much for your good analysis and insightful comments on this specific aspect SATC.I will work together with Rong and come up with a solution as to how the Q22 meeting report will be appropriatedly reworded.Based on C3541 itself,actually, Q22 meeting report just faithfully reflects the basic idea of the scheme described.But if we also take into account the supplementary information Rong provided in his mail to you, then Q22 report may need incorporate some more information.
As for your questions regarding the feasibility of using signature> > > matching to block pirated contents transmission.I agree that we need more careful consideration.I think it will be ideal if more contributions will be submitted by Rong to the next SG16 meeting so that we can have more extensive discussion.My personal feeling is that it is a rather innovative idea to use this kind of techniques to block illegally distributed> > > contents, but there may still lack some feasible technical points to enable this idea to be transformed into a practical solution.
Paul, do you think this arrangement will be okay?
Best Regards
Noah
华为技术有限公司 huawei_logo
地址:深圳市龙岗区坂田华为基地 邮编:518129 http://www.huawei.com -------------------------------------------------------------
----- --------------------------------------------------------- 本邮件及其附件含有华为公司的保密信息,仅限于发送给上面地址中列出的个人 或群 组。禁 止任何其他人以任何形式使用(包括但不限于全部或部分地泄露、复制、或散 发)本 邮 件中 的信息。如果您错收了本邮件,请您立即电话或邮件通知发件人并删除本邮件! This e-mail and its attachments contain confidential information from> HUAWEI, which is intended only for the person or entity whose address is
use of the information contained herein in any way (including, but not
total or partial disclosure, reproduction, or dissemination) by persons other
intended recipient(s) is prohibited. If you receive this e-mail in error,
phone or email immediately and delete it!
-----邮件原件----- 发件人: itu-sg16-bounces@lists.packetizer.com [mailto:itu-sg16-bounces@lists.packetizer.com] 代表 Paul E. Jones 发送时间: 2008年9月11日 10:48 收件人: 'zourong 52447' 抄送: itu-sg16@lists.packetizer.com 主题: Re: [itu-sg16] Q22 Question - SATC
Rong,
I think the meeting report is not exactly correct. The Q22 meeting> > > report said that SATC would block content that that failed to match a signature. What you said is that SATC would block media content that matches a "good" signature, but is apparently altered in some way. That is, it would block media that it can clearly identify as pirated content.
So, perhaps we should revise the wording that is presently in
meeting report related to AVD-3541. Can you consult Noah on what the correct statement should be?
As for whether this will work, I still have doubts. Even storing> > > signatures of all current media could be a massive database and, quite
that would need to be in memory, as you need real-time (and likely wire- speed) access to the signatures.
I have no objection to doing work on this: if you can meet a
match against signatures will not stop any serious pirate. It might stop the casual pirate, though.
A data stream of pirated content can be altered in so many ways
would be easy for somebody to create a program to stream pirated content that does not match a signature in the database. I could run
stream through a cipher algorithm, for example. And, I could even transmit the cipher key as the last payload of the message: after all, the entire file has been delivered! Secrecy wasn't the purpose of the encryption,> but merely to disguise what is being transmitted until we've successfully> bypassed the SATC "detectors".
Perhaps the simplest thing to do, actually, given the wide proliferation of tools like OpenSSL, is to just use TLS between nodes that want to transmit pirated content. That provide more-than-acceptable level of encryption.
In any case, I did not intend to bring the work to a halt, by any means. But, be cognizant of the fact that this is a very complex
seriously doubt you can devise a method that would prevent a good> > > hacker from getting around the system. At best, you will only stop the casual> "pirates". You would not stop serious pirates or popular "file> sharing" software. Such software would definitely employ techniques to get around any detection/blocking logic in the router/switch.
What's more important right now is understanding what should have been stated in the meeting report. I do not think this sentence is accurate:
“... if SATC finds a data stream carrying content failing to match a known signature, it will block the stream or mark it as being suspicious.”
Paul
-----Original Message----- From: zourong 52447 [mailto:zou.rong@huawei.com] Sent: Wednesday, September 10, 2008 4:05 AM To: paulej@packetizer.com Cc: itu-sg16@lists.packetizer.com Subject: [itu-sg16] Q22 Question - SATC
Paul, really thanks your's comments, here is the clarification. 1) As to the circumventing problem, I really could not say
a perfect way that could not be circumvented. But I don’t
easy to do so. This scenario is attempted to provide a tool for content provider to prevent their “specially processed” content, i.e. copy- righted content from being pirated. It is assumed the “specially> > > > processed” content have some characteristic information that would be destroyed by any piratical actio n such as using a different decoding method, and these characteristic> > information or “signature” would be provided to SATC system by content provider. As to the compress scheme to circumvent, I think the SATC> > could handle most of the common compress schemes such as winzar,> > winzip, tar etc., but I also think it is impossible the SATC could handle all compress schemes. 2) The method mentioned in AVD-3541 is not attempted to create a database of all pirated media, it’s a database that store the signature of good media especially those most popular video or audio> > recently, so the database will keep in a state of acceptable size. The periodical updating of the database is necessary just like the IDS do to update the anti-virus database. 3) The pirated media database could exist as an subsidiary database.> > This was not mentioned in the AVD-3541 because I
detail to describe it in a scenario. The file transmitted will be suspected as pirated file when satisfied following two points: (1) There is a item in the “good media database” that means the file is a target of protecting. (2)the signature of the file transmitted do not match with that stored in the data base. Then the suspected file could be blocked directly, or in a more prudent way, it could be just marked as suspicious and search in a subsidiary “pirated media database” to make sure it is a existing> > > > pirated version or it maybe a new pirated version and need more manual analytical works of content provider. 4) As to the performance problem, I think this kind of inspection may have some negative impact on the packet transmission but may not as terrible as thought, in fact, many DPI product in market can achieve> > 10G to 40G, even 80G throughput, their process capability is remarkable.
发件人: Paul E. Jones 发送时间: 2008-09-05 10:59:38 收件人: itu-sg16@lists.packetizer.com 抄送: 主题: [itu-sg16] Q22 Question - SATC
Q22 Experts,
While reviewing the Q22 meeting report, I was intrigued
by a
comment related to AVD-3541. This contribution proposes
can be used to block illegal content from being transmitted over the Internet by examining the media flow and comparing that media> > > flow (or some signature thereof) against a database. SATC would then block content as appropriate.
I would like to comment: 1) I could easily circumvent any such measure, so any such system would prove to work only temporarily; and 2) While a given copy of a digital copy of some content, such as music or a movie, will have a clearly recognizable signature, the same media can be “substantially” altered by merely using a different encoding method (e.g., a different video codec or compression scheme); and 3) Actually attempting to create a database of all known> > > > > pirated media, including all variations, would result in the creation of a massively huge database; and 4) Maintenance of such a database would be a never- ending> > > > > chore; and 5) Notwithstanding the foregoing, while such a database could> > > be constructed and flows could be examined, inspecting flows in real-time and trying to positively identify a given flow is a monumental task that will result in significantly negative impact> > > on packet transmission performance, even if a bank of specialized> > > network processors were employed per router or switch
In short: do we really want to attempt this? While it’s all technically possible (at the risk of turning a 1Gbps pipe into a 1Kbps pipe), I would argue that it’s not going to work. I would be happy to be first in line to write the software necessary to circumvent such a system. And I would do it just because I can ;-)
Having said that, the meeting report said that, the meeting report> > > said that “if SATC finds a data stream carrying content failing to match a known signature, it will block the stream or mark it as being suspicious.”
I underlined the text of significance. This suggests
------ listed> > > above. Any limited to, than the please> notify the sender by the Q22 likely, one particular> business requirement, great. But, I personally think that trying to that it the media problem and I their is think it is think it maybe too that SATC that the
database would only be populated with signatures of known good media flows and would only block unrecognized flows. This seems different than what I read and understood from AVD-3541. What the meeting report suggests would be a simpler problem with a substantially smaller database. So, what was actually discussed and decided for SATC? A system that blocks unrecognized flows or a system that blocks recognized flows?
Thanks, Paul