*** Disclosure: The following is the output resulting from transcribing an audio file into a word/text document. Although the transcription is largely accurate, in some cases may be incomplete or inaccurate due to inaudible passages and grammatical corrections. It is posted as an aid to the original audio file, but should not be treated as an authoritative record.*** IDN Variant TLD Monday, 20 June 2011 ICANN Meeting - Singapore >> Ladies and gentlemen, we're about to begin our program. It is the IDN variant TLD. If you move forward in your seats, it would be much easier for our panelists and much more interesting for you. Thank you. >>KURT PRITZ: Hi. Welcome, everybody, and thank you for coming. Could you guys in the middle sit down or.... Hi. I'm Kurt Pritz. I am the senior vice president of services at ICANN. Thanks very much for coming to what is this session about very important work that ICANN is undertaking with a renewed vigor with the idea of delegating variant TLDs. It's very important now that IDNs have been delegated that we continue to work in order to provide a good user experience for those who use IDNs so that they can fulfill their potential in broadening the participation of the use of the Internet across the world. Variants, as the people in this room know, are a very difficult issue. Variant TLDs contain characters. Because those characters represent the same character in an alphabetic set, they can be very confusing. And so we have options right now. Either delegating variant TLDs that might result in user confusion and a poor user experience, or not delegating variant TLDs, and essentially disenfranchising a sector of that community that uses that alphabetic set. So it's really important that we do this work to resolve that conundrum so that we can delegate new TLDs that are variants and guarantee a good user experience. Now, there's been a lot of work on this in the past, several working groups. And the experiences from those working groups have taught you as few things. One is that when we investigate issues associated with variant TLDs, we make good headway, say in one script set, say Chinese, but then we think that really won't work for Arabic, and the discussion of trying to develop one set of issues and one set of solutions across all scripts has been difficult and problematic. And so members of the ICANN community have hit on a solution that, or a potential for solutions that involve the conducting of six separate case studies, one each for different scripts, and identifying the issues solely associated with those scripts. And that way, work can proceed on each script, a set of issues developed, and then later on, solutions developed for those scripts. We'll find that some issues, of course, are across all scripts but some issues are script specific. So we think this, conducting six different case studies for six different scripts, is a very good way to go about investigating what the issues are. ICANN's been very fortunate in attempting to recruit participants for these studies. I think we presently have 65 participants from 30 countries staffing these six different panels. And we also have the contributions of several host occurs that are contributing meeting space or a travel budget or Secretariat support, and other means for supporting this project. So in a very short period of time, we have a broad reach of participation across the community. And this meeting really represents the start of substantive work on that. Each one of the case-study teams has already met in developing a set of definitions and carrying on the future work, and each has made separate plans for how they are going to carry that work forward. So it's really my pleasure to introduce the people behind me. I'm flattered is exactly the right word. I'm so flattered that this set of people have come to work with ICANN. And when I say "ICANN," I mean the big ICANN community, not the bunch of staff members, to help solve this problem. And so I'm going to not introduce them all. I am going to, instead, introduce Dr. Dennis Jennings who is a former board member and is performing a lot of work in coordinating this group and providing leadership in getting the right people on board, organizing the work, and attracting host organizations such as we've had. So I want to thank you very much for coming, for your interest in this topic, and I want to start by thanking Dennis for taking over for me and starting the presentation. >>DENNIS JENNINGS: Thank you, Kurt. Thank you very much, indeed. If we could have -- Well, we have this first slide up. Could we go on to the next slide, please. So why this project? Well, it's been a longstanding request from many people in the community that we address this issue. In fact, it's been a longstanding request that we solve this issue. But when we look at it, we discover that there are many unanswered questions. And so the board instructed the CEO and the staff to develop an issues report on the subject. What do we mean by an "issues report"? Basically we mean a statement of requirements or a statement of the problem that we're trying to address. Now, a lot of people think that the problem is easily stated, and it usually is. It's "the problem is easy. Solve it for me." And it turns out when you look at the whole variant issues, even the definition of variants is not agreed across the set of scripts. So this project is focused on issues, not on solutions. So where are we with IDN variant TLDs today? Well, in the new program just announced -- and congratulations to the board and to Kurt on that achievement -- applicants may declare variant strings for the TLD in its application. Some work needs to be done, I think, in that area in the Applicant Guidebook, but that's the idea. But no variant TLD strings are going to be delegated until variant management solutions are developed and implemented. And the first step in this is to understand what the issues are, what the statement of requirements is, what the problem is so that we can then move on to develop solutions for IDN variants. So here's the scope of the project. First thing is to develop or create a glossary of terms that's been vetted with the communities. And that turns out to be an interesting challenge, which I'll talk about in a moment, because the terms are not precisely defined. They mean different things to different people, and it turns out they mean different things in different scripts. So that's the first thing, to create a glossary of terms that is consistent across the scripts that we're trying to address, and then to identify the challenges or the requirements, the problem statement based on linguistic accuracy, technical feasibility, usability, accessibility, security and stability, and so on. So the first proposal or the final proposal for this was published on the 20th of April. The initial proposal was published for this study. There was -- published for public comment. We got comment back. We're very grateful to the community for, given all the other things that were going on, to actually look at this and to give us comment. And as a result of that, of the comments, we added a Greek case study. We had originally proposed only five studies: Arabic, Cyrillic, Latin, Indic and Chinese. As a result of the public comments, we added a Greek case study, and we also refined what we had called the Indic case study, which is a misnomer because there isn't an Indic script, to be the Devanagari case study, a much tighter -- a single script, a much tighter definition. So we got a lot of positive feedback from the community, and that's the final proposal that was published. So the next step was we published a call for volunteers for the six case studies. We received over 70 applications across the six case study teams, and we did a lot of work to clarify those issues, and we've completed and published the selection of the team members. And um see that information later on in this presentation. And we selected the case study coordinators, the key people who will take responsibility for these six case studies, major projects in themselves within the project that we're undertaking. The teams are comprised of experts in a variety of relevant areas, as you can read on the slide up there. The DNS, operations, linguistics, security, policy, and, indeed, the IDNA protocol. They are led by the case study team coordinator, and they are going to have regular conferences, face-to-face meetings to do the work. And we in ICANN will provide support to that. In addition, we have bolstered our own team with some external experts, and there are two named there. Andrew Sullivan, who is a DNS and IDNA expert, who provides a lot of help on the technical side. And Andrew isn't with us. He is in Canada at the moment. But he did participate by phone on the various meetings we've had so far. And Nicholas Ostler, who is a linguistic expert who is with us and who has identified himself by raising his hand so that you know who he is. In addition, we have selected, and we're very pleased to have had a number of proposals for host organizations. As Kurt said, they will provide a variety of support from meeting space through to administrative support and, in some cases, even financial and travel support for the case study team, varying from case to case. And we're very pleased and we announced this, we published this the other day, that ICT Qatar has been selected, volunteer has been selected as the Arabic host organization. CDNC, the Chinese Domain Name Consortium, for the Chinese case study. UNESCO in Paris for the Cyrillic case study. The Department of Information Technology, the DIT, in India, and their department CDAC and NIC SI for the Devanagari case study. ICS-FORTH or FORTH-ICS in Crete for the Greek case study. And dot SE in Sweden or the Internet Infrastructure Foundation for the Latin case study. And we are really pleased to have these organizations roll in behind these studies. As you saw from the scope of work, the first thing to do is create a glossary of terms that everybody could agree with. And we have developed -- the team has developed with a lot of help from Andrew and from Nicholas a definitions document defining the terms being used. And that has been circulated and will be published. It's an initial document. We felt very pleased with it. We thought that we had this kind of nailed down, and it's interesting that much of the discussions on Saturday among the case study teams have indicated to us that there's quite a bit more work to be done on definitions, and the definitions may vary from case to case. But what we're trying to do is to make sure that we have a single set of definitions, and there may be slightly different definitions for different cases, but we are going to use words with a clear meaning. We want to have a clear glossary of terms. And if there is a slightly different definition for one case, we're going to use a different term for it so that it's clear, so that when we use terms, and hopefully when the community use terms, we'll all understand precisely what that term means. And that is certainly not the case at the moment, and I can tell you there is a lot of discussion you may have heard over the last couple of days where the word variant is bundled -- bundled -- is bandied around in a variety of fashions that are completely inconsistent with each other. The case study team feedback is required. That has been started. We started with two types of definitions from existing references and new definitions, and this is very much a work in progress, as I said. So we asked the case study teams to use the terms as defined. I already commented on that. Really, I've covered this slide in a variety of ways in what I've just said. And the goal of this and the restrictions that we're trying to apply is to minimize the difficulty later on during the harmonization step. I repeat, so that when we use a word, that everybody knows precisely what we mean by that word. So how do you guys in the community stay informed? Well, there's a mailing list that you can sign up for where there will be information posted regularly, and where you can participate. And there will be a community Wiki that you can sign up for, and the URL is there. And the work has started. The case study teams met -- what day is today? Today is Monday. On Saturday. I am already lost and it's only the first formal day of the ICANN meeting. On Saturday, we had a whole day meeting with the six case study teams, a couple of plenary sessions to work together, and individual sessions. And I'm delighted to say that all the teams got to work and did a lot of work, a lot of hard work there. I'm very impressed with the amount of effort that the teams are putting in. And the case study coordinator will present here a brief snippet of the work and the ongoing work and the plans for each of the case study teams. The work has only started. There will be lots of telephone conferences, there will be meetings. The schedules aren't yet -- or schedules as the Americans say -- aren't yet finalized but they will be published as the case study teams develop them. And the intent is, and we're holding to this, if we can, and it's up to the case study teams to try to deliver to this, that each case study team will develop an issues report by the 30th of September this year. Not the 30th of September next year. The 30th of September 2011, this year. That's challenging, particularly in the northern hemisphere where, as you know, particularly in Europe, lots of people go on holidays. It's also challenging in the Arabic script case study because Ramadan is in August this year, and I'm sure there are other challenges, but I know every case study is going to try to deliver the report by the 30th of September. And then our goal for the final issues report where case study coordinators will get together, probably in Senegal, in Dakar in Senegald, to try and see whether we can have a harmonized report or whether we have six individual reports with a report that highlights those things where there is agreement across the reports. That's due by the 15th of December. So now let's hear from the individual case studies on some of the work that's done. And is Sarmad Hussain here with us? Yes, indeed. Sarmad, would you like to take the microphone and given us a brief update. >>SARMAD HUSSAIN: Hello, everybody, and thank you, Dennis. I'd start -- So what I'm going to do is very briefly overview, introduce my team, the team for Arabic script case. It comprises of some very experienced people who have been working on Arabic script and Arabic script-based languages for some time. We have Abdulaziz (saying name )from Saudi Arabia, (Listing names). All these people bring a lot of experience in this area on the table, and that makes the work easy in a way because they are very familiar with the issues which exist with Arabic script. But also, you know, we understand the challenge because we have to actually close this work in a very short time. I'll just introduce some of the challenges which are relevant. So what I've done here is put the IDN ccTLD string for Pakistan at the top. And you can see that both strings which are shown are exactly the same visually, even though they use different code points from Unicode internally. And just to highlight the significance of why variant TLDs may be required for Arabic script. The second example is just a different word, it's not a TLD, but it highlights a different kind of problem, for example, where it's not that the different characters look the same way, but a combination of a combining mark -- excuse me -- with a letter looks exactly the same way as a letter which already is separately encoded in Unicode. And some of these cases, for example, are dealt with normalization in Unicode, but there are many such cases in Arabic script which are actually not dealt through Unicode normalization. So just to give you very briefly some examples of the issues we are dealing with as far as TLDs are concerned. Next slide, please. And here is a very brief but aggressive schedule which we are going to follow. And I must thank Baher for putting this slide together. And just to take you through this, we are actually going to take two different activities, the two different activities by the team. And I'll start with the issues list. So we are going to do two separate documents. We are going to work on the issues and work on definitions, as Dennis pointed out as well. We have already identified a first list of issues which -- yesterday, and we plan to go through and expand our case as much as possible through the e-mail list we have and try to close -- agree and close on the issues by the end of the week this week. And then based on those issues, we are going to draft a document with clearly -- which clearly explains those issues with examples. And what we are trying to do is not just come up with some representative examples but actually come up with an exhaustive list of those examples for each of those issues. And then eventually finalize -- once the document is prepared, finalize the document based on internal discussions and then hand that over to the IDN VIP team. In parallel, what we're going to do, though we initially started looking at the definitions document which was given to us by ICANN, very quickly on the first day we realized that it actually may not be possible for us to close the definitions without really looking at issues in more detail first. So what we're now going to do is finalize the issues first. And then, once the issues are taken care of, we'll come back to the definitions document. And first try to define the definitions from our context. And then look at how ICANN is defining these items or glossary items and see where we match and where we don't match, see what the differences are and whether they can be incorporated into the larger document. So that's what the brief plan is. We'll first do the document, issues document, then do the definitions document and then, hopefully, hand both things over to ICANN by the end of September. So thank you. >>DENNIS JENNINGS: Thank you very much, indeed. A very impressive piece of work. So you're well up and running. And thank you for that. I'd like to move on to the Chinese case study. Is professor Xiaodong Lee here? Xiaodong, yes. Would you give us the highlights of the work so far? >>XIAODONG LEE: This is Xiaodong Lee. I think I am the coordinator of the Chinese case study team. Can we move to the next slide? Yeah. So I want to give a very brief introduction about the case study team. You know, I'm very happy that we have the local host Chinese domain name. The CDNC is -- I've been working for Chinese domain name for many 10 years. I think there are so many memories from the Chinese- speaking area and also some from the Europe and Americans. They have so many discussions. So I think if we have the CDNC be the local host, it would be smooth, some case study work. Next slide. I'm going to just give you a very brief member highlight. You know, as the CDNC is local host, in the team members we also have the people from CDNC, the Professor Seng, Professor James Seng is the chairman of NIC and is also cochair of CDNC. And we also have Dr. Zhang. Also on behalf of the doc CNIC. And, you know, I think you have to remember that the speech (saying names) is the IDN working group chair. And James Seng is also the main contributor to the registry. Myself is the also contributor of the 4713. Also the fundamental document for the Chinese issues. And, you know, I also cochaired the -- I'm informal cochair of the IETF internationalized email address working group. But I'm very happy that we also have Professor Zhang is the main contributor of the Unicode CDCJK. And also the main contributor of the Chinese variant table. It was published in IANA. Lists variant table have been adopted by so many registries which provided the Chinese domain name registration. I think next slide? Also I'm very happy to have the team member Joseph Ye and also (saying name) Those two people are also the current IETF working group cochairs. Joseph Ye is co-chair of the international e-mail address. And Yoneya is also the cochair of the IETF working group. So these two working groups is the current active working groups which are involved in the internationalized issues, IETF. So we also have the representative from registries from dot CN, dot HK, dot TW, dot com, dot Asia. I think some of them operate the IDN TLDs. And we also have the community person from the communities from APTRD and also from Chinese domain name user in the Asian Pacific region. This case study team is distributed. We have team members from Asia Pacific region, but we also have members from Europe and American. So I think that this case study team will work together to finalize the issue report. I want to give some information about the past two days discussion meeting. At the request by ICANN, we have finalized the issue part of Chinese case study in the end of September. But we have a consensus in that case study team which should review, finalize the question list. And the -- in the end of July. And try to initial the issue report in August. And the first question of the issue probably should be released in the early of September. So it will give us much time to discuss and to get the review from other expert. I'm going to -- except for the team member of this case study, we also want to conduct the other expert from IETF or from other organizations to make sure the issue report is accurate and right. I think, you know, the Chinese variant issue is not a fresh new issue. The Chinese community has been working on the Chinese variant issues for over 10 years. Even in the very beginning of the IDN internationalized working group in IETF, the variant issue is a very important issue. So many discussions in IDN working group in IETF, how to do the symbol for Chinese and also other kind of variant issues for Chinese language. In the past 10 years since it was published, it's not a very perfect solution. But it is workable and acceptable by he Chinese community. And I also want to mention that now IETF is also working on some new solutions for the variant issues. It's not only for Chinese case. It's also maybe acceptable by other language. I also see that it's involved in this room. So -- but I want to clarify that that maybe two states now for the Chinese variant issue, it maybe found an acceptable solution and ways that the issue report. But in the next state maybe we can find more perfect solution in the IETF or other organizations. But the most important work for this case study team is to clarify the issues and finalize the issue product. This issue will be useful for the ICANN community. I think that all the members of this case study team work very hard and try our best to finish this work. Thank you. >>DENNIS JENNINGS: Xiaodong, thank you very much indeed. Excellent, and excellent to hear you have a plan to beat the deadline and get comment out early. Can I ask Alexey Sozonov who is coordinator for the Cyrillic to give an update? >>ALEXEY SOZONOV: Okay. Brief update of our Cyrillic case study group is we had a meeting and we understood that here is the scope of the broad definitions which are for all languages. And which need to be boiled down to the definition precise for Cyrillic languages and Cyrillic script. And just from the first meeting we acknowledged that we could not cover all Cyrillic scripts since it's too many languages and small languages. So we'll cover languages which is in the scope of the members of our group. And we'll have two experts, Sergei and (saying names) They are current authors of Cyrillic script 5992. And here is the list of the members of our group. Next, please. We also understood that we need broader community input during the discussion period on the Wiki to bring the issues that might be overlooked by us. As well we think it's a good idea will be to have communications with other coordinators and to share the working program because sometimes people look to things from different perspective and they bring up issues which was overlooked. We decided to meet once a week. And we might have a necessity to have a face-to-face meeting in the end of that work. And we are happy that our host organization is UNESCO. We didn't know about that. And next slide. Here it is. And, since we've been learning the current definitions, we understood that they were created for different languages and different cases. And most of them are much more complicated than Cyrillic requires. So we need to work on it together with an expert. And thank you. This is all update. >>DENNIS JENNINGS: Thank you very much indeed. Excellent. Could I ask Dr. Govind to speak about Devanagari -- I think I will get this word correctly pronounced one of these days -- to talk about the case studies. Thank you. >>DR. GOVIND: Thank you. Thank you for the opportunity to elaborate about the Devanagari, which supports major languages of the country. And thank you for the opportunity to get us into a team, which will be going into the various aspects of this IDN TLD variation. So our team will be consisting of ... (listing names). I will be taking time to elaborate what these persons like the community representatives will be that the Devanagari script will be (saying name.) The linguist will be consisting of Dr. Rajiv and Mrs. Shashi (saying names.) They have more than two decades of experience in the linguistic aspects of the Devanagari. Then we have registry/registrar operations. We have (saying names) and Kumar who belong to NIC SI and Afilias. They are technical service provider. They will be assisting us in the registry/registrar operations in the IDN TLD variations for Devanagari script. Then we have a policy part which will be done by (saying names.) We have taken from the diverse backgrounds, diverse agencies who will be assisting for this activity of IDN TLD. Then we have (saying names) and James Galvin from Afilias who will be assisting us. Then we have security and stability. Again, James Galvin and Amardeep, who is one of the third operations in the country. Then we have one of the directors in that. So, overall, this team comes from a rich background and a diverse knowledge of the aspects of the computers and the security aspects, variant aspect, and the script variation aspect. So all this, and community expertise is also there. So we expect that this composite team will help us to come out with a good study on this Devanagari script. Next, please. Some of the complexities of the script like Devanagari I will elaborate. I don't have the slide here. But it is like L and 1 in the Latin example a homographic variant. Like yesterday we talked about dra and dra. These two look like similar looking scripts. So the definitions need to be suitably worded to accommodate the nuances of complex script like Devanagari. So that is what we need to see is how the various complexities of the variant homographic variations in the Devanagari, how can we accommodate that in the definitions, which looks like more generic definition and which can take account of that variation. The word "register" will be changed to "located" and "activated" be changed to "delegated." That's what we discussed the last two days. And the character variant definition needs to be redefined preferrable variant dominant uses be considered. And 1:1 mapping of code in this variant is not the case in Devanagari. However, multiple code point sequence may have variance. And the language variant table is given what needs to be accommodated this kind of requirement. This will be elaborated in our discussions with the team from ICANN and give them the examples how this kind of -- why this makes sense Devanagari is not 1:1. It is one to many and many to one kind of mapping. We need to consider this aspect very carefully while defining these definitions and putting that in a proper structure. The case study group will produce a table containing an explanation of what they understand on the use definition and provide examples. That is what we discussed at each of the Arabic, Cyrillic, and the Chinese groups. If they can come up with a kind of table, what they understand from these definitions, which is given by ICANN on each of the IDN variant, and how each definition is supported by the examples. So that will help us to go more into the commonality between the various definitions and the examples of each script which we are studying here. The same should be shared with all the study group members. That is also we discussed. The coming to the timelines, what we discussed, since it's a complex script. So we -- we will have a weekly meeting teleconferencing planning to start from 27th of June, immediately after reaching our entry. And so that, on weekly conferences, we discuss the nuances of various complex structure of the Devanagari script and how we define the definitions and the examples. Then to come out with a overall structure of the Indian Devanagari script languages, we are organizing a conference on 6 of July in New Delhi with specific references to new gTLDs where we are inviting all the registrars, Internet community, the government officials, the industries, so that they also know. And we're inviting representative from ICANN also so that they can see how the Devanagari script and the Indian languages have a complexity how we can accommodate it. Face-to-face meeting planned in Pune. In one of the meetings since conference calls, we discussed how we can have a face-to-face meeting in Pune. And dates have also been finalized, the 21st and 22nd of July, where we can have a person from ICANN, IDN group, and the various stakeholders of the IDN in India. And we're organizing this meeting in Pune where the premiere institution of the language technology, CDAC, is based. And then we intend to finish this job by 30th of September, which is the timeline for completing that duty. Thank you. >>DENNIS JENNINGS: Thank you very much indeed. An extraordinarily complex case. And very impressed that so much work has already got started and so much planning done. I'd like now to ask Panagiotis Papaspiliopoulos to speak on the case study on behalf of Vaggelis Segredakis. Vaggelis, unfortunately, was injured in a car crash just after he accepted. I hope there's no association between accepting to be a case study coordinator and a car crash. But, unfortunately, he appears to be quite badly injured. And we send him all our very best wishes. So, Panagiotis, can you give us an update on where we with are the Greek case study. >>PANAGIOTIS PAPASPILOULOS: Thank you, Dennis. Yes. I'll speak on behalf of Vaggelis who had a serious accident two or three days before he was going to take a plane to come here to Singapore. Can I present you, please, the members of the Greek team? So Vaggelis is our coordinator. And he's based in Crete, where it's the registry of dot CR domain names. And they have a lot of experience there in handling the Greek letters. We have -- for your information, we have introduced the Greek letters and the dot CR for the second level domain six years now. And the variant case was examined technically and unilaterally at that time. And so we believe that we can offer and transfer our experience and our problematic to the ICANN project. And next member is Asimina (saying name) member of the regulator. And Catherine Tsapikidou is working for the ministry of foreign affairs. Evangelos (saying name) is an expert working for the Greek civilization organization and has involved many times in formulating relevant ISO standards, international standards, of course. And one of them is the Unicode standard. Fotia Panayiotou is a member of the dot CY registry. But CY is for Cyprus. Obviously, since there are two countries with the official language the Greek, Greece and Cyprus, here help would be valuable in order to have a common approach and common solution to the problems that might arise. George Papavlou is a member of the commission, of the telecommunication. It's the regulator. And he used to work for the commission, the European Commission. And maybe some of you may know him well, especially some people who follow ICANN meetings a long time. His experience is expected to be very valuable in this project. Giannis Papaioannou is a member of the heliographical service of the Navy. It's a linguistic expert and also an expert in geographical names. And he's also participating in several meetings around the world, and he's representing Greece in the relevant fora around the world. So we expect his help and his experience to be valuable as well. And it's myself. And I am a telecommunication expert. I'm working for the ministry of infrastructure transport and networks for the general secretariat of communications. So I'm nine years now. And we had cooperated with Vaggelis during the launch of the Greek -- for the second level domain and the dot CR six years ago. And I'm following ICANN almost nine years now. And I'm the GAC representative of Greece. So this is our team. And we hope to do our best to present something that this -- we all want to be useful, not only to the ICANN staff but also to the Internet community. And we have realized that, even if Greek is a language that not too much people speak, it's a language that too many people learn and maybe they want to use on the Internet as well. So, in coming to the project now, we would like first of all, to define and identify character set since Greek characters can be found in monotonical and polytonical systems. And it's set to tables in the Unicode standard that covers the set of Greek characters. So we have identified the set. For your information, all the characters are supported for the second level domain and the dot CR. And, after that, we want to present the variant issue and to provide to the definitions and to -- for the verification of variants to provide as many examples as it is necessary in order to identify the problems. Just a small brief, the variant can be two kinds of variants. One kind of the variants could be if you put the tone on a different vowel. So the word in capitals looks the same but, actually, it's not the same. And the capital is the problem. Because, otherwise, the encoding is different. And then the other case is when you have the same two words that look alike. In this case, we have realized from the very beginning that the -- this could be a field that we should cooperate also with the Latin and the Cyrillic team, since many of the Greek letters are also in the Latin alphabet and the Cyrillic alphabet. We believe that we have experience. And we are optimistic, even if our coordinator has a serious health problem, that we will meet the deadline and we will provide ICANN the full identification of the variant issue. And, since we have, as I told you before, some regulatory experience, even if it's not in the scope of the project of the states, maybe you could provide some information, some proposals for this as well. Thank you very much. >>DENNIS JENNINGS: Well, thank you very much, indeed. Can I call on Jothan Frakes, now, who is the case study coordinator for the Latin case. >>JOTHAN FRAKES: Thank you very much, Dennis. My name is Jothan Frakes, and I am a volunteer with the Mozilla project working with the application developer community on assimilating IDN into use for the general user community. And it's a privilege to have the opportunity to serve working with a very good volunteer team. If we could have the next slide, please. Working with Dr. Andrzej Bartosiewicz, (Listing names). We also have staff support from Kim Davies, helping us work together on some of the issues that exist in documenting those, capturing those, as well as definitions and around those with respect to the Latin script. We have determined, like many of the others who have spoken here today have talked about the definitions in and around the use of the term variant, what those mean within each of the language study teams. And we're working hard on defining what those are. And in fact, trying to determine some of those terms -- terminology and variants that do or don't apply, but at least documenting those. So the people who finally are the consumers of the end result documentation that we do are able to make well-informed decisions about the next steps. We're working with a language set -- excuse me, a script set that has a fairly good exposure in a very simplified form in that A through Z have been available for well over 25 years now in the DNS system. We're working to capture different elements of user experience and expectation and how those feed into the use of Latin characters as we extend end into diacritic marks and ligatures, et cetera, as we expand the Latin character sets that would be available in the Domain Name System at the top-level domain. We're trying to factor in things like what the user experience is with respect to their QWERTY keyboards, Q-W-E-R-T-Y for the transcriptor, as well as AZERTY, A-Z-E-R-T-Y. And these different layouts mean different things in different countries, in different uses. We are also taking a look at the display of characters and how that expectation works within applications and fonts for what the user experience is. An example of the user experience would be that in the current DNS system that whether you type a capital letter A or a lowercase letter A, the DNS system treats those identically. And we're working with some character sets and some language character repertoires that are fairly well structured using the Latin code sets -- or, excuse me, scripts from within the Unicode definitions. It's our objective to help document what we consider to be variants. And there are a variety of variant classifications, such as homographic characters as well as homophone strings, there's homographic strings as well as orthographic variants, and a number of other variants. So we're working to define variant so that we have a common vocabulary amongst our team that we can articulate to the other working groups. It we have also determined that we will have some exposure and need to interact with other teams such as the gentlemen to my right and left, Panos and Alexey, with the Greek and Cyrillic teams as well, because there is some overlap in those character sets as well. As far as the schedule goes, our team will be working with a biweekly schedule call. We are making a determination if the necessity of a face-to-face meeting will be there. We hope to ultimately make that determination by the end of July. Ultimately, we are pleased to have a very good team composition. We're already well under way with assignments, and we hope to work within the September 30th deadline with no issues. And that is my update. Thank you. >>DENNIS JENNINGS: Jothan, thank you very much, indeed. I realize that I didn't introduce a couple of key people from the project team, and I would like to do that now, and these are the case study liaisons. They are the staff on the team who are the key contact points and liaisons with the case study coordinators, and they are the people that you should know about on our side. And for the Arabic case study, Baher Esmat is the liaison. Baher is in the audience there. For the Chinese, Steve Sheng is the coordinator, who is on the panel up here. For the Cyrillic, Francisco Arias, who is beside me here, is the coordinator. From the Devanagari script, Naela Sarras is the liaison. For the Greek, Julie Hedlund, who is on the panel up here. And for the Latin, Kim Davies is -- is Kim here with us? >>KURT PRITZ: Kim is unfortunately across town at a DNSSEC event. >>DENNIS JENNINGS: That's right. He is at a DNSSEC signing ceremony but I think Kim is well-known to you all. I am going to open the microphones to questions from the floor in a moment, but before I do that, just a couple of things. When I started working on this project, I, like many other people, felt that this was a technical problem, and it should be thrown over the wall and a technical solution should be found. I've been rudely abused by my technical friends and colleagues that, in fact, it's not a technical problem. And there is no technical solution. And if there were a technical solution, it would take a long time to implement across the DNS, the hugely distributed database system called the DNS. So having been disabused of that, and you should note that because there isn't just a simple technical solution to this. And even if there were -- and some people say, well, DNAME, DNAME solves all the problems, I'm not sure the technical people would agree but even if they did, DNAME is not fully deployed across the DNS, so it would take time to deploy and to implement. So there ain't an easy solution here. There's a hard solution, which is to find out what the problem is and then to address that problem. It's probably at the registry, at the application layer. Maybe if we're very lucky, there will emerge a technical solution, but I personally now think that's unlikely. The other thing, and I said this at the beginning of the Saturday sessions, is I think we need to be conscious to reduce the complexity of the problem as much as we can so as to maximize the likelihood that some IDN variants -- some IDN variants will be delegated. The perfect is the enemy of the good. Peter Dengate Thrush mentioned that in relation to the resolutions that were agreed this morning. We can search for the perfect. We can search for solving all the issues, and we may find that if we try and do that, that we'll never get a solution. Now, having said that, I must remind everybody the first thing is to identify the issues, all the issues. But we should be mindful of categorizing those issues so that we can perhaps, in the next phase, find solutions for the most important ones that affect the largest number of Internet users. So let's have in back of our minds that we need to reduce the complexity, if at all possible. So with those sort of caveats to the project and to the community, I'd like to invite anybody who has a question or a comment to come up to the microphones and to address the panel. And you don't have to all rush at once. >>CHRIS DILLON: Hello. It's Chris Dillon from University College, London with a question to Dr. Govind about some Devanagari issues that came up a couple of days ago. And there was talk of there being problems with the Devanagari script because several languages use that script, and there was concern that they would be applying for identical strings and there would be major problems with variants. But I was just wondering if you could explain that in greater detail. >>DR. GOVIND: I will ask my colleague, Dr. Kulkarni, to explain that. >>MAHESH KULKARNI: The Devanagari, as you rightly said, is a script and there are a couple of languages which are schedule languages are being represented using Devanagari. The issue here is that the core page is the same while the languages are being derived from that. And we have done a study of the variants basically. We don't see any problem in relation with the -- basically the variants which are coming out across the languages. For example, if you want basically an IDN for Marathi, then automatically for the Hindi language, it will not be a given. So that's where we are trying to address that. And because from the end-user perspective, you don't have on the URL bar, basically, you don't have a language identification system. So Devanagari, as it appears as a script, and whose server applies first? Either the Hindi people or either the Marathi people or either the rest of the languages people, the first come, first served? This is what exactly we are plan. I hope I have answered your query. >>CHRIS DILLON: Yes. Now, one, just very quick, additional question. The Punjabi has a script, a Gurmukhi script, and is that regarded as being part of Devanagari or is that separate? >>MAHESH KULKARNI: No, actually, we have got 22 Indian languages, Shudhu languages. There are many more. And some of the languages have been represented by two different scripts. For example, if you take Punjabi, it has been written using Gurmukhi as well as using Shahmukhi. But Shahmukhi is like Arabic style of writing basically. But since it has not been currently incorporated into the Unicode, we are not taking that. So we will be considering only Punjabi, which is written in Gurmukhi script. >>CHRIS DILLON: And you would not use the word Devanagari to refer to -- >>MAHESH KULKARNI: No, no. They are totally two different scripts basically. We have got, for 22 Indian languages, we require something close to around ten different scripts are there to represent all the 22 languages. >>CHRIS DILLON: Thank you very much, indeed. >>DENNIS JENNINGS: Thank you, Chris. We have a questioner in the middle, please. >>ANDREW MACK: Thank you. Andrew Mack. To your discussion of the different languages that are -- the different scripts that are used for the similar languages, we've talked in the past about the idea of trying to bundle those together so that an applicant might be able to apply for two or more scripts to use -- that are used to express their language. And I was curious about your thoughts about that in the case much India. And also, what would the cost be of that? You know, so if you can add in a second -- what's the actual cost of that? Not the 185 but the actual technical cost of adding a separate script? >>DENNIS JENNINGS: Microphone. Dr. Govind, the microphone is off. >>DR. GOVIND: Ram would like to answer. >>RAM MOHAN: This is Ram Mohan. I guess I didn't understand when you said what is the technical cost of adding another script. You would simply instantiate -- if you were running a registry you would simply instantiate another script. And whatever the TLD label is, you would simply run it that way. Now, I don't think, so far, there has been much discussion about the combination of multiple scripts and calling those different scripts as variants of each other. I don't think we've got there yet. And I don't think that's what Dr. Kulkarni was talking about either. So if you look at, for example, the Devanagari example, you have Devanagari, and as Dr. Kulkarni was saying, you have Hindi and you have Marathi in there. The idea is, for example, Bharat, which is India in Hindi. If you want to get Bharat in Marathi, or if you want to get a second-level domain name in there, that's a different problem set all together. But if you would like to get a domain name, whatever, dot XYZ, in Devanagari and you want to get that same string, XYZ, in, say, Tamil, there is not a plan or a proposal to treat them as a bundle. Eventually, you would end up having two separate registries. You would end up running it through the registrars, and you would offer it as two unique sets of strings. So I hope that answers your question. >>DENNIS JENNINGS: Thank you, Ram, for that explanation. The word bundling, I've heard, has been bandied around quite a bit. Like all these terms, I'm not quite sure what it means. I think we need to define bundling, and then we need to work out whether this is just a self-serving statement of interest or whether there really is an issue here, and whether it belongs to our project or not. So yet another challenge to us to define bundling and to see what it means. Patrik, we have a question. >>PATRIK FÄLTSTRÖM: Patrik Fältström. I am chair of the SSAC, Security and Stability Advisory Committee. I just want to make one remark in the process we have here. I am really, really happy to see this project starting, let me start there, and I am also really happy with the quality of the discussion here at this session. I'm one of the authors of the IDN standards. I have been fighting with these kinds of problems myself for many, many years. But, for example, I hear that people at this session, one of the first time in the ten years I am working at this, I hear people really separate the definition of script and language. Well done. Okay. My remark regarding the process. As the chair of SSAC, I just want to make everyone understand that there are many, many individuals that are also members of SSAC, including staff and also volunteers that are part of this project. But just because we have that, I do envision in the future that this project might actually ask specific questions to SSAC. So I just want to point out to everyone that we don't have any liaisons formally between SSAC and this project, but we have individuals that participate in the project. So when we later might have questions that are sent over to SSAC or if it's the case that this group, for some reason, would like to have a formal liaison, because we at SSAC do very explicitly separate incoming and outgoing liaison relationships with other external groups at ICANN, in that case I am really happy to have that discussion and create those liaisons. Because I want these individuals to participate in these projects to be able to use their individual skills, their individual knowledges to participate fully in these projects and not feel they have to be bound neither now or in a potential future discussion inside SSAC when we try to answer some of these questions with the work that the previous have done. Thank you. >>DENNIS JENNINGS: Patrik, thank you very, very much for that and that's very helpful, indeed, and you and I should have a discussion off-line to flesh that out a little bit and see when that's appropriate to set up the liaisons, if and when. Thank you. I appreciate that. We have an online question, I believe. Francisco. Oh, Naela, you are going to take the question. Thank you. >>NAELA SARRAS: Thank you, Dennis. We have a question from Mr. Paco Fernandez. The question is can VeriSign now apply for new IDN gTLDs -- for example, Russian and Hebrew -- or would they be held up by this variant issue? >>DENNIS JENNINGS: And Francisco, do you want it take that question? >>FRANCISCO ARIAS: Sure. I believe the simple answer is yes, they can apply. They are not going to be held up. Anyone can apply for an IDN, would be able to apply for an IDN gTLD. What we are discussing in this project is the variants of IDNs. >>DENNIS JENNINGS: And if I may add to that, as I said at the beginning, if there is a variant, you may apply for a string and state a variant. If you look carefully at the Applicant Guidebook, um see that. I think there's some more work to be done on -- in that area. So if you have variant strings, although variant TLDs will not be delegated until this project has produced the issues report and then a subsequent project has looked at solutions, there is, nevertheless, the opportunity in the first round to apply for a TLD and to name a variant. Just to make that clear. Have we any other questions from the audience? Yes, we have a questioner. Please come up to the microphone and identify yourself. >> My name is Promot Pondhi (phonetic). I come from Jalalal (phonetic) University. This is my first ICANN meeting. A question I have is for the members in general. I wonder if we want to identify the issues of variation variants. If you can arrive at something like consensus of the classifications -- for example, looking linguistically at the Devanagari script, I can think of two clear sources of variants. One I would consider historical; that is, we have characters which stand for sounds which are no more there. But because, say, the language using it today -- for example, Hindi -- has relation with Sanskrit, so although the sounds have changed, but the letters have been kept intact. So you have examples, letters of a Gaelic ru (phonetic) kind of thing which is not pronounced. Shajitu (phonetic), you can write it in two different ways. The other has to do with the relevant orthographic level. I think of since Hindi, Devanagari is a syllabic script, so you have something like a syllable represented by one grapheme, so things like Chun in Chunden (phonetic) and Chum in Chumpeck (phonetic). The variation is on account of treating Chun as a phoneme, and so you have something called a dot Bindhu, but if it is not phoneme, if it is to do with surface representation, then you have the actual nasal consonants, like nah, mah (phonetic), et cetera. So the variations are on account of referring to different orthographic levels. So I can think in Devanagari, at least, of two different sources of variants, orthographic level. So these graphemes result -- variation results because they refer to two different levels, and historical sort of remnant sounds. So I wonder -- I believe the languages, scripts such as Latin and Greek may have similar problems. I have no clear idea. But some consensus, at least, something like sharing of discussion, dialogues, can lead to the consensus on types of -- sources of variants and solutions to them. Thank you. >>DENNIS JENNINGS: Thank you for that. Yes, one of the things that we have discovered is that we need to have lots of examples to clarify what we're talking about, and those examples are helpful. So there's a lot of work to be done here. Dr. Govind or Dr. Kulkarni, do you want to take an initial response to that question? >>MAHESH KULKARNI: Yeah, actually, what Dr. Pondhi (phonetic) was referring to, variants, basically, especially coming from two different streams. One is the variant coming from the phonetic variations which comes up and which we term as homophones. And there is another variation which comes from the -- what is called the way one looks at the screen, which means that homo graphs. Both aspects were considered and debated. We did this study for almost last four to five years, and we came to the conclusion that homophones -- treating homophones will be a difficult task and we might debar a lot of what is called the possible IDNs. And we have restricted ourselves only to the homographs. And the latest Unicode version, 6.0, caters to what is called the language nuances as well the script, which is properly represented, the languages which are properly represented using the Devanagari script. So I don't see we should, at least for the Devanagari, we should not treat homophones. We are only going like homographs, as is the case with the ASCII, like color and C-L-O-R and C-O-L-U-R, and we treat them differently. A similar case lies with Devanagari. Like Hindi, are written in two different forms. Both are acceptable forms. With Bindhi (phonetic), that is with Donaswar (phonetic), and with Nahallendar (phonetic). So these are two forms, but since they are not similar looking, we have taken a decision to keep them separated. So I hope that answers the question. >>DENNIS JENNINGS: Thank you. Any other case study coordinator want to comment on the -- Nicholas. >> Could I make a method lodge cam point about this. Obviously everything that has been said is absolutely correct. But as I understood the purpose of this activity, it is to identify potential cases, to generalize over them and to see what principles might be appealed to in future in adjudicating whether things are variants or not. We are not giving solutions at this stage, but the more examples we consider, the more powerful and effective and useful in the future the principles we arrive at will be. And of course you will find different principles in respect of different scripts. And probably there will be more principles to scripts we are not considering as well but that's for the future. Thank you. >>DENNIS JENNINGS: Thank you. Any case study coordinator want to comment on homophones rather than homographs? I think it's been very well covered by Dr. Kulkarni. So seeing no volunteer and seeing that we have no questioner on the floor, do we have any further questions from the -- excellent. With five minutes to go we can clear the room for the next session. Thank you very much, indeed -- oh, Ram, you have a question? >>RAM MOHAN: Not a question, actually. Just a comment. For very helpful suggestions like Dr. Pondhi said, Dennis, it would perhaps be useful to put up on the screen or someplace where these folks can actually continue to participate and provide input into the various case study teams because this is a community effort and what's need aside that kind of multiplicity of opinion. Whether or not, you know, having two -- considering two sources is a generalizable problem is one thing, but having that kind of input come in is really the critical core of what this entire effort is about. Thank you. >>DENNIS JENNINGS: Ram, thank you very much, indeed. You are absolutely right. This -- To emphasize, this is a community project. It's not an ICANN project. It's a community project. The case study teams are community teams. And there's an opportunity for everybody in the community to participate. And Francisco has brought back up on the screen the mailing list and the Wiki. The mailing list for participation, the Wiki for information that will be published. And it's our intent, while we will be having -- the case study meetings will be closed just for working purposes, the outputs will be published for everybody to see. And these are the mechanisms for people to ask questions, to raise questions, and to contribute. And we certainly hope, because I emphasize, these are community projects, not ICANN projects, that people, as we have seen, will ask questions and will contribute. For ICANN, I would like to say that I have been -- and I think I can say this as a consultant and not as a full staff member, that I have been tremendously impressed both with the case study teams and the work they're doing and with the staff, and I think this project is in good hands. And with that positive note, ladies and gentlemen, thank you very much, indeed. [ Applause ]